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Dwp Mutants 

Cross-Reference to Related Application 
This application is related to provisional patent application serial no. 60/179,901, 
filed February 2, 2000, fi"om which priority is claimed under 35 USC §1 19(e)(1) and 
which is incorporated herein by reference in its entirety. 

Technical Field 

The present invention relates generally to plants that display altered structure or 
morphology and to the genes imparting such pheontypes. In particular, the present 
invention pertains to Dwarf/ (dwf7) mutants and methods of using the same. 

Background of the Invention 

Sterols are knovra to play at least two critical roles in plants: as bulk components 
of membranes regulating stability and permeability (Bach et al. (1997) Prog. Lipid Res. 
36:197-226) and as precursors of growth-promoting brassinosteroids (BRs; Fujioka and 
Sakurai (1997) Nat. Prod. Rep. 14:1-10). Lesions in brassinosteroid (BR) biosynthetic 
genes result in characteristic dwarf phenotypes in plants. Understanding the regulation of 
BR biosynthesis demands continued isolation and characterization of mutants 
corresponding to the genes involved in BR biosynthesis. 

Sterol biosynthesis in plants has been studied extensively through enzyme 
purification or gene cloning (Grunwald (1975) Annu. Rev. Plant Physiol. 26:209-236; 
Goodwin (1979) Annu. Rev. Plant Physiol. 30:369-404; Benveniste (1986) Annu. Rev. 
Plant PhycioL 37:275-308; B?.cb ^*nH Renveniste (1997) Prog. Lipid Res. 36:197-226). 
Figure 1 shows the proposed biosynthetic pathway from squalene to brassinolide (BL). A 
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major difference between photosynthetic and nonphotosynthetic organisms is that 
cyclization of squalene 2,3-oxide is bifurcated to a different route for each system 
(Benveniste (1986) Annu. Rev. Plant Physiol. 37:275-308). In animals and yeast, 
squalene 2,3-oxide is cyclized to lanosterol, whereas in photosynthetic organisms it is 
5 cyclized to cycloartenol (Nes and McKean (1977) Biochemistry of Steroids and Other 
Isopentenoids. (Baltimore, MD: University Park Press)). Accordingly, photosynthetic 
organisms require somewhat different biosynthetic enzymes, such as cycloartenol 
synthase (Corey et al. (1993) Proc. Natl. Acad. Sci. USA 90:11628-11632) and 
cycloeucalenol-obtusifoliol isomerase, which are required to open the cyclopropane ring 

10 in cycloartenol (Figure 1). However, most of the enzymatic steps are shared between the 

two different pathways. 

In plants, sterols are subject to a series of modifications before conversion to BL. 
Different sterols, such as 24-methylenecholesterol (24-MC), campesterol (CR), 
isofucosterol, and sitosterol, are converted to the BL congeners dolicholide, BL, 

15 28-homodolicholide, and 28-homoBL, respectively, in a species-specific manner (Fujioka 

et al. (1997) Plant Cell 9:1951-1962; Sasse (1997) Physiol. Plant. 100:696-701). The 
BR-specific pathway diverges into the early and the late C-6 oxidation pathways. In the 
early C-6 oxidation pathway, introduction of a 6-oxo group occurs before the vicinal 
hydroxylation reactions at the side chain, whereas it occurs after these hydroxylations in 

20 the late C-6 oxidation pathway (Figure 1; Choi et al. (1997) Phytochemistry 44:609-613). 

Several mutants, such as constitutive photomorphogenesis and dwarfism (cpd)^ 
deetiolated2 {det2), and dwarf4 {dwf4), have been shown to be defective in the 
BR-specific pathway (Li et al. (1996) Science 272:398-401; Li et al. (1997) Proc. Natl. 
Acad. Sci. USA 94:3554-3559; Szekeres et al. (1996) Cell 85:171-182; Choe et al. (1998) 

25 Plant Cell 10:231-243). These BR biosynthetic dwarfs share a characteristic dwarf 

phenotype, which includes short robust stems, reduced fertility, prolonged life cycle, and 
dark-green, round, and curled leaves when grown in the light. In the dark, these mutants 
exhibit short hypocotyls and expanded cotyledons, cpd {dwf3) mutants are only rescued 
by 23a-hydroxylated compounds (Szekcrcs ct zl (1996) Cell R5:171-182). The CPD 

30 gene was shown to encode a cytochrome P450 steroid hydroxylating enzyme 
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(CYP90A1). In addition, Li et al. (1996) Science 272:398-401 and Li et al. (1997) Proc. 
Natl. Acad. Sci. USA 94:3554-3559 showed that det2/dwf6 is blocked in the C-5 
reduction step. DET2 was found to be homologous to steroid 5a-reductases. Like its 
animal equivalents, DET2 successfully converted progesterone (3-oxo-A'*'^ steroid) to 
5 4,5-dihydroprogesterone in a human cell line. In addition, the human 5a-reductase gene 
effectively complemented det2 mutants (Li et al. (1997) Proc. Natl. Acad. Sci. USA 
94:3554-3559). Recently, it has been shown \haXDWF4 encodes a cytochrome P450 
whose amino acid sequence is 43% identical to CPD\ DWF4 has been named CYP90B1 
(Choe et al. (1998) Plant Cell 10:231-243). Based on results from feeding studies using 

10 BR biosynthetic intermediates, the proposed rate-limiting step of BR biosynthesis, 

22a-hydroxylation, is now known to be blocked in dwf4 mutants. 

In the plant sterol biosynthetic pathway, several of the genes have been cloned or 
identified based on heterologous expression or sequence similarity. First, Corey et al. 
(1993) Proc. Natl. Acad. Sci. USA 90: 1 1628-1 1632 isolated a cycloartenol synthase 

15 cDNA by heterologous complementation of yeast mutants lacking lanosterol synthase^In 

addition, two types of cDNAs encoding sterol methyltransferases have been isolated from 
soybean (Shi et al. (1996) J. Biol. Chem. 271:9384-9389) and Arabidopsis (Husselstein et 
al. (1996) FEBS Lett. 381:87-92). The Arabidopsis cDNA has been shown to mediate a 
second methyltransferase step leading to C29 sterols (Bouvier-Nave et al. (1997) Eur. J. 

20 Biochem. 246:518-529). For the 14a-demethylation reaction, Bak et al. (1997) Plant J. 

1 1 : 191-201 cloned the cDNA encoding the 14-ademethylase cytochrome P450 enzyme 
(CYP51) from Sorghum bicolor. Based on sequence similarity, Grebenok et al. (1997) 
Plant Mol. Biol. 34:891-896 identified an Arabidopsis sterol C-8 isomerase (GenBank 
accession number AF030357). Furthermore, an ERGOSTEROL25 (ERG25) homolog for 

25 Arabidopsis (C-4 demethylase) also has been discovered in the genome sequencing 

project (GenBank accession number AL021635). Finally, a sterol C-7 reductase has been 
cloned by heterologous expression of an Arabidopsis cDNA in yeast (Lecain et al. (1996) 
J. Biol. Chem. 271:10866-10873). 

As compared willi uie wealth of cloned genes sterol biosynthesis, only one 

30 mutant has been found in these genes. Gachotte et al. (1995) Plant J. 8:407-416 screened 
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an ethyl methanesulfonate (EMS)-induced mutant population (22,000 M2 plants) for 
mutants displaying an altered sterol profile. The screen yielded one mutant, sterol! 
(stel), whose endogenous level of C-5-desaturated sterols is reduced to 30% of that of the 
wild type. Expression of the yeast gene ERGS (the gene for sterol C-5 desaturase) in 
5 the steJ-1 mutant increased the level of C-5-desaturated sterols 1 .7- to 2.8-fold compared 

with the stel-1 control, suggesting functional conservation of the enzymes fi'om yeast and 
plants. However, visible phenotypes were not found in stel-1 plants. Thus, the authors 
hypothesized that the residual 30% level of C-5-desaturated sterols was sufficient for the 
growth of plants. 

10 A large collection of BR dwarf mutants have been characterized. Of the eight dwf 

loci identified to date, dwf3 (cpd; Szekeres et al. (1996) Cell 85:171-182), dwf4 (Choe et 
al. (1998) Plant Cell 10:231-243), and dwfS {det2; Li et al. (1996) Science 272:398-401) 
have been shown to act in the BR biosynthetic pathway, whereas dwf2 (bril) probably is 
involved in BR percepfion (Clouse et al. (1996) Plant Physiol. 1 1 1 :671-678; Li and 

15 Chory (1997) Cell 90:929-938). 

Disclosure of the Invention 

The present invention is based on the discovery of various mutants of a BR 
biosynthetic locus, designated dwarf/ (dwfT). The STEl locus in dwf7 mutants contain 

20 loss-of-fiinction mutations. Two allelic variants of dwf7 have been characterized, dwf7-l 

and dwp-2, also designated stel-2 and stel-3, respectively. A homologue of the dwf7 
mutants, HDF7, is also described herein. Feeding studies with BR biosynthetic 
intermediates and analysis of endogenous levels of BR and sterol biosynthetic 
intermediates indicate that the defective step in the dwp mutants resides before the 

25 production of 24-methylenecholesterol in the sterol biosynthetic pathway. Furthermore, 

results firom feeding studies with ^^C-labeled mevalonic acid and compactin show that the 
defective step is specifically the sterol C-5 desaturation. Sequencing of the STEl 
locus in the two dwp variants shows premature stop codons in the first {dwp-l) and the 
third {dwp-l) exons. Thus, the reduction of BP.s in dwp is due to a shortage of substrate 

30 sterols and is the direct cause of the dwarf pheno type in dwp. 
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Accordingly, in one embodiment, the present invention is directed to an isolated 
dwp polynucleotide that imparts at least one dwp mutant phenotype when expressed in a 
plant. The polynucleotide is selected from the group consisting of (a) a polynucleotide 
comprising the nucleotide sequence depicted at positions 143 to 322 , inclusive, of 
5 Figures 8A-8D; (b) a polynucleotide comprising the nucleotide sequence depicted at 

positions 143 to 1552, inclusive, of Figures 8A-8D; (c) a polynucleotide comprising a 
nucleotide sequence having at least about 70% identity to the nucleotide sequence of (a) 
or (b); (d) a fragment of (a), (b) or (c) comprising at least about 15 contiguous 
nucleotides; and (e) complements of (a), (b), (c), (d) or (e). 

10 In other embodiments, the present invention is directed to an isolated dwp 

polynucleotide that imparts at least one mutant phenotype when expressed in a 
plant. The polynucleotide is selected from the group consisting of (a) a polynucleotide 
comprising the nucleotide sequence depicted at positions 1506 to 2720, inclusive, of 
Figures lOA-lOF; (b) a polynucleotide comprising a nucleotide sequence having at least 

15 70% identity to the nucleotide sequence of (b); (c) a fragment of (a) or (b) comprising at 

least 15 contiguous nucleotides; and (d) complements of (a), (b), (c) or (d). 

In additional embodiments, the present invention is directed to recombinant 
vectors comprising the isolated dwp polynucleotides described above, and control 
elements that are operably linked to the polynucleotides whereby a coding sequence 

20 within the polynucleotides can be transcribed and translated in a host cell, and at least one 
of the control elements is heterologous to the coding sequence. Also provided are host 
cells transformed with the recombinant vectors, and methods of producing a DWF7 
polypeptide comprising providing a population of host cells as described above and 
culturing the population of cells under conditions whereby the DWF7 polypeptide 

25 encoded by the coding sequence present in the recombinant vector is expressed. 

In yet ftirther embodiments, the subject invention is directed to a transgenic plant 
comprising a polynucleotide described above, as well as methods of producing a 
transgenic plant comprising the steps of introducing a polynucleotide into a plant cell to 
produce a transformed pl^nt Cf^H; and producing a transgenic plant from the transformed 

30 plant cell. 
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In an additional embodiment, the invention is directed to a method for ahering the 
sterol composition of a plant relative to the wild-t3^e plant comprising introducing a 
polynucleotide as described above into a plant cell to produce a transformed plant cell 
and producing a transgenic plant from the transformed plant cell, wherein the transgenic 
5 plant has an altered sterol composition relative to the wild-type plant, such as an altered 

cholesterol composition relative to the wild-type plant. 

In still further embodiments, the invention is directed to isolated DWF7 
polypeptides encoded by the polynucleotides as described above. In certain 
embodiments, the polypeptide consists of the amino acid sequence depicted at positions 
10 1-60, inclusive, of Figure 9 or the amino acid sequence depicted at positions 1-230, 

inclusive, of Figure 9. In other embodiments, the polypeptide consists of the amino acid 
sequence depicted at positions 1-279, inclusive, of Figure 11. 

In other embodiments, the subject invention is directed to an isolated control 
element having at least about 70% identity to a control element found within nucleotide 
15 positions 43-142 of Figures 8A-8D, or 1-1505 of Figures lOA-lOF, a recombinant vector 

comprising the control element and a polynucleotide comprising a coding sequence 
which is heterologous to the control element, host cells transformed with the recombinant 
vector, and methods of producing a recombinant polypeptide comprising providing a 
population of the host cells and culturing the population of cells under conditions 
20 whereby the recombinant polypeptide encoded by the coding sequence present in the 

recombinant vector is expressed. 

These and other embodiments of the present invention will readily occur to those 
of ordinary skill in the art in view of the disclosure herein. 

25 Brief Description of the Figures 

Figure 1 shows the proposed BL biosynthetic pathway from squalene to BL. 
The BL biosynthetic pathway is divided into the sterol-specific pathway, squalene to 
campesterol, and the BR-specific pathway, campesterol to brassinolide. Common names 
fcr the compounds are lahpled. and proposed enzymes involved in each reaction are 

30 boxed and labeled. Genes identified by mutants are marked. The acronyms for some 
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compounds are in parentheses. In the inset, the carbon atoms of the sterol core rings and 
side chain are numbered. 

Figure 2 is a bar graph of measurements of gynoecia and stamens of wild-type, 
(ecotype Wassilewskija-2 [Ws-2]), dwp-l, and dwf4-3 plants. The dwP-1 plant displays 
5 a concomitant reduction in the length of gynoecia and stamens, whereas dwf4-3 displays a 

greater reduction in stamen length. Each data point represents the average length for five 
flowers. Standard errors are shown at each data point. Solid bars indicate the 
gynoecium and white bars denote the stamen. 

Figure 3 compares the response of Ught-grown wild-type and dwf7-l hypocotyls 
10 to different concentrations of BL. Black bars indicate results using the Wassilewskija-2 

(Ws-2) wild type and white bars dwp-l plants. The dwP-1 plant responds to 10"^ M BL 
and is completely rescued by 10'^ M BL. Error bars indicate ±SE. 

Figure 4 is a bar graph comparing wild-type and dwp-1 inflorescences treated 
with BR intermediates. The lengths of pedicels treated with water, 6-deoxoCT, 
15 22-OHCR, and BL were measured to the nearest millimeter (n > 15). The pedicels 

elongated greater than twofold in response to all the BRs tested, suggesting that the 
biosynthetic defect in dwp-1 resides before the production of CR. Error bars indicate 
±SE. 

Figiu-e 5 shows GC-MS analysis of wild-type and dwp-1 seedlings fed with 
20 ^^C-MVA in the presence of compactin, an inhibitor of MVA biosynthesis. 

Accumulation of episterol with a simultaneous decrease of downstream intermediates, 
including 24-MC and CR, predicts that the C-5 desaturation step is blocked in dwP-1 
plants. The units are in micrograms per 5 g fresh weight of tissue. The designation ND 
(not detected) means that the quantity is lower than the detection lirnit. Ws-2 is the 
25 Wassilewskija-2 wild type. 

Figure 6 is a schematic representation of the STEl gene. Comparison of cDNA 
and genomic DNA sequences revealed three exons (thick boxes) and two introns 
(horizontal bars). The single open reading frame encodes a protein of 281 amino acids. 
The dwp'2 ystel-3) mutation iz located in the first evon. changing a tryptophan to a stop 
30 codon. The dwp-I {stel-2) mutation also changes a tryptophan to a stop codon (amino 
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acid position 230). The three white boxes indicate the transmembrane domains, and the 
three histidine boxes are lightly shadowed. The figure is drawn to scale by using the 
^ GCK software (Textco, Inc., West Lebanon, NH). Bar = 120 bp. 

"^^^^ ^ Figure 7 depicts a multiple sequence alignment of DWF7/STE1 with kno^ 

5 sequences for sterol C-5 desaturases. The GenBank accession numbers for i 

sequences are M62623 (5. cerevisiae) (SEQ ID NO: ), AB004539 

(Schizosaccharomyces pombe) (SEQ ID NO: ), L40390 (C. glabrptd) (SEQ ID 

NO: ), and AF105034 (DWF7/STE1, Arabidopsis) (SEQ IDXO: ). The conserved 

transmembrane domains and histidine clusters are boxed^d labeled. The positions of 
10 the premature stop codons in dwP-1 and dwp-2 a^e^dicated with filled circles. 

Histidine residues in each conserved histidiiirm)x are identified with filled triangles. A 
consensus sequence (SEQ ID NO: > r^ shown in the bottom row of the alignment. 
Capital letters stand for residue^/<5onserved among all sequences, whereas lowercase 
letters mean >50% identipajT Dashes indicate gaps introduced to maximize alignment. 
1 5 Multiple sequence^i^mient was performed using PILEUP in the Genetics Computer 

Group soflw^r6(Madison, WI) with a gap creation penalty of 4 and a gap extension 
paramet^ of 1 . The annotation of the aligned sequences was performed using the 
. ALg^;RIPT software (Barton (1993) Protein Eng. 6:37-40). 

^""^ Figures 8A-8D depict the complete gene sequence of dwp^ denoted^ 
20 grey bar. The premature stop codons for dwf7-l and dwfTzZ-aFcrgHown with triangles at 

nucleotide positions 1552 and 322, resgectiv^IS^The coding sequence and corresponding 
amino acid sequence arg-repfes^ted by a light grey bar. The mRNA sequence is 
representedj>y^lack bar and is shown in three segments. The gene includes two introns 
(po^ki6ns 369-735 and 1042-1395) and three exons. 
25 " — Figure 9 shows the amino acid s^qii^n^^^j^^^U^T^ ^^^^^C ^o d inc nrqrm TfF^ 
designated in Figures 8A::8Dr'^flTe'polypeptide sequences corresponding to the dwp-2 
and (iw^Z-^Palleles occur at positions 1-60 and 1-230, respectively. 
^ b ' V, Figures lOA-lOF show the gene sequence qfthe^bi^^Ziiofnologue, HDF7, The 
coding sequence ^r^^ ^^^^ 




•TTTmngarninn acid sequence are shown in three segments 
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(exons), occurring at positions l506A734, 2024'2329^ ^JM6=2J2(^'eMt t Ti^m^, Tl lT 
^positions 1 - 1 505 and the 3' UTR occurs at positions 2721-2925. 
Figure 1 1 shows the amino acid sequence * 
designated in Fj^ia:es-i0A=1TJF7^ polypeptide sequence corresponding to the HDF7 
polypeptide occurs at positions 1-230 of the figure. 



Detailed Description of the Invention 

The practice of the present invention will employ, unless otherwise indicated, 
conventional methods of protein chemistry, biochemistry, recombinant DNA techniques 
and pharmacology, within the skill of the art. Such techniques are explained fully in the 
Uterature. See, e.g., Evans, et al. Handbook of Plant Cell Culture (1983, Macmillan 
PubHshing Co.); Binding, Regeneration of Plants, Plant Protoplasts (1985, CRC Press); 
Sambrook, et al.. Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); 
Methods In Enzymology (S. Colowick and N. Kaplan eds.. Academic Press, Inc.); 
Remington's Pharmaceutical Sciences, 18th Edition (Easton, Pennsylvania: Mack 
PubHshing Company, 1990). 

All publications, patents and patent applications cited herein, whether supra or 
infra, are hereby incorporated by reference in their entirety. 

It must be noted that, as used in this specification and the appended claims, the 
singular forms "a", "an" and "the" include plural referents unless the content clearly 
dictates otherwise. Thus, for example, reference to "a polypeptide" includes a mixture of 
two or more polypeptides, and the like. 

The following amino acid abbreviations are used throughout the text: 



Alanine: Ala (A) 
Asparagine: Asn (N) 
Cysteine: Cys (C) 
Glutamic acid: Glu (E) 
Histidine: His (H) 



T T ^„ n \ 



Methionine: Met (M) 



Arginine: Arg (R) 
Aspartic acid: Asp (D) 
Glutamine: Gin (Q) 
Glycine: Gly (G) 
Isoleucine: He (I) 
Lysi^fi- T ys (K^l 
Phenylalanine: Phe (F) 
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Proline: Pro (P) 
Threonine: Thr (T) 
Tyrosine: Tyr (Y) 



Serine: Ser (S) 
Tryptophan: Trp (W) 
Valine: Val (V) 



5 I. Definitions 

In describing the present invention, the following terms will be employed, and are 
intended to be defined as indicated below. 

The terms "nucleic acid molecule" and "polynucleotide" are used interchangeably 
and refer to a polymeric form of nucleotides of any length, either deoxyribonucleo tides or 

10 ribonucleotides, or analogs thereof. This term refers only to the primary structure of the 

molecule and thus includes double- and single-stranded DNA and RNA. It also includes 
known types of modifications, for example, labels which are known in the art, 
methylation, "caps", substitution of one or more of the naturally occurring nucleotides 
with an analog, intemucleotide modifications such as, for example, those with uncharged 

15 Hnkages (e.g., methyl phosphonates, phosphotriesters, phosphoamidates, carbamates, 

etc.) and with charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), those 
containing pendant moieties, such as, for example proteins (including e.g., nucleases, 
toxins, antibodies, signal peptides, poly-L-lysine, etc.), those with intercalators (e.g., 
acridine, psoralen, etc.), those containing chelates (e.g., metals, radioactive metals, boron, 

20 oxidative metals, etc.), those containing alkylators, those with modified linkages (e.g., 
alpha anomeric nucleic acids, etc.), as well as unmodified forms of the poljmucleotide. 
Polynucleotides may have any three-dimensional structure, and may perform any 
fimction, known or unknown. Nonlimiting examples of polynucleotides include a gene, a 
gene fi*agment, exons, introns, messenger RNA (mRNA), transfer RNA, ribosomal RNA, 

25 ribozymes, cDNA, recombinant polynucleotides, branched polynucleotides, plasmids, 

vectors, isolated DNA of any sequence, isolated RNA of any sequence, nucleic acid 
probes, and primers. 

A polynucleotide is typically composed of a specific sequence of four nucleotide 
bases: adenme (A); cyiusiiic (C); guanine (G); and thymine (T) (uracil (U) for thymine 

30 (T) when the polynucleotide is RNA). Thus, the term polynucleotide sequence is the 

10 
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alphabetical representation of a polynucleotide molecule. This alphabetical 
representation can be input into databases in a computer having a central processing unit 
and used for bioinformatics applications such as functional genomics and homology 
searching. 

5 Techniques for determining nucleic acid and amino acid "sequence identity" are 

known in the art. Typically, such techniques include determining the nucleotide sequence 
of the mRNA for a gene and/or determining the amino acid sequence encoded thereby, 
and comparing these sequences to a second nucleotide or amino acid sequence. In 
general, "identity" refers to an exact nucleotide-to-nucleotide or amino acid-to-amino 

10 acid correspondence of two polynucleotides or polypeptide sequences, respectively. Two 

or more sequences (polynucleotide or amino acid) can be compared by determining their 
"percent identity." The percent identity of two sequences, whether nucleic acid or amino 
acid sequences, is the number of exact matches between two aligned sequences divided 
by the length of the shorter sequences and multiplied by 100. An approximate alignment 

15 for nucleic acid sequences is provided by the local homology algorithm of Smith and 

Waterman, Advances in Applied Mathematics 2:482-489 (1981). This algorithm can be 
applied to amino acid sequences by using the scoring matrix developed by Davhoff. Atlas 
of Protein Sequences and Structure , M.O. Dayhoff ed., 5 suppl. 3:353-358, National 
Biomedical Research Foundation, Washington, D.C., USA, and normalized by Gribskov, 

20 Nucl. Acids Res. 14(6):6745-6763 (1986). An exemplary implementation of this 

algorithm to determine percent identity of a sequence is provided by the Genetics 
Computer Group (Madison, WI) in the "BestFit" utility application. The default 
parameters for this method are described in the Wisconsin Sequence Analysis Package 
Program Manual, Version 8 (1995) (available from Genetics Computer Group, Madison, 

25 WI). A preferred method of establishing percent identity in the context of the present 

invention is to use the MPSRCH package of programs copyrighted by the University of 
Edinburgh, developed by John F. Collins and Shane S. Sturrok, and distributed by 
IntelliGenetics, Inc. (Mountain View, CA). From this suite of packages the Smith- 
Waterman aigoritiuii uaii be employed where default parameters are used for the scoring 

30 table (for example, gap open penalty of 12, gap extension penalty of one, and a gap of 



11 



2225-0003 
PATENT 



six). From the data generated the "Match" value reflects "sequence identity." Other 
suitable programs for calculating the percent identity or similarity between sequences are 
generally known in the art, for example, another alignment program is BLAST, used with 
default parameters. For example, BLASTN and BLASTP can be used using the 
5 following default parameters: genetic code = standard; filter = none; strand = both; cutoff 

= 60; expect =10; Matrix = BLOSUM62; Descriptions = 50 sequences; sort by = HIGH 
SCORE; Databases = non-redundant, GenBank + EMBL + DDBJ + PDB + GenBank 
CDS translations + Swiss protein + Spupdate + PIR. Details of these programs can be 
found at the following intemet address: http://www.ncbi.nlm.gov/cgi-bin/BLAST. 

10 Altematively, the degree of sequence similarity between polynucleotides can be 

determined by hybridization of polynucleotides under conditions that form stable 
duplexes between homologous regions, followed by digestion with single-stranded- 
specific nuclease(s), and size determination of the digested fi:-agments. Two DNA, or two 
polypeptide sequences are "substantially homologous" to each other when the sequences 

15 exhibit at least about 70%-85%, preferably at least about 85%-90%, more preferably at 

least about 90%-95%, and most preferably at least about 95%-98% sequence identity 
over a defined length of the molecules, or any percentage between the above-specified 
ranges, as determined using the methods above. As used herein, substantially 
homologous also refers to sequences showing complete identity to the specified DNA or 

20 polypeptide sequence. DNA sequences that are substantially homologous can be 
identified in a Southem hybridization experiment under, for example, stringent 
conditions, as defined for that particular system. Defining appropriate hybridization 
conditions is within the skill of the art. See, e.g., Sambrook et al., supra; DNA Cloning, 
supra; Nucleic Acid Hybridization, supra, 

25 The degree of sequence identity between two nucleic acid molecules affects the 

efficiency and strength of hybridization events between such molecules. A partially 
identical nucleic acid sequence will at least partially inhibit a completely identical 
sequence fi-om hybridizing to a target molecule. Inhibition of hybridization of the 
completely identical sequence can be assessed ucing hybrid i7:at.i on assays that are well 

30 known in the art (e.g., Southem blot, Northem blot, solution hybridization, or the hke, see 
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Sambrook, et al., Molecular Cloning: A Laboratory Manual, Second Edition, (1989) 
Cold Spring Harbor, N.Y.). Such assays can be conducted using varying degrees of 
selectivity, for example, using conditions varying from low to high stringency. If 
conditions of low stringency are employed, the absence of non-specific binding can be 
5 assessed using a secondary probe that lacks even a partial degree of sequence identity (for 

example, a probe having less than about 30% sequence identity with the target molecule), 
such that, in the absence of non-specific binding events, the secondary probe will not 
hybridize to the target. 

When utilizing a hybridization-based detection system, a nucleic acid probe is 

10 chosen that is complementary to a target nucleic acid sequence, and then by selection of 

appropriate conditions the probe and the target sequence "selectively hybridize," or bind, 
to each other to form a hybrid molecule. A nucleic acid molecule that is capable of 
hybridizing selectively to a target sequence under "moderately stringent" typically 
hybridizes under conditions that allow detection of a target nucleic acid sequence of at 

15 least about 10-14 nucleotides in length having at least approximately 70% sequence 

identity with the sequence of the selected nucleic acid probe. Stringent hybridization 
conditions typically allow detection of target nucleic acid sequences of at least about 10- 
14 nucleotides in length having a sequence identity of greater than about 90-95% with the 
sequence of the selected nucleic acid probe. Hybridization conditions useful for 

20 probe/target hybridization where the probe and target have a specific degree of sequence 

identity, can be determined as is known in the art (see, for example, Nucleic Acid 
Hvbridization: A Practical Approach , editors B.D. Hames and S.J. Higgins, (1985) 
Oxford; Washington, DC; IRL Press). 

With respect to stringency conditions for hybridization, it is well known in the art 

25 that numerous equivalent conditions can be employed to establish a particular stringency 

by varying, for example, the following factors: the length and nature of probe and target 
sequences, base composition of the various sequences, concentrations of salts and other 
hybridization solution components, the presence or absence of blocking agents in the 
hybridization solutions (e.g., iorinamidc, dextran sulfate; and polyethylene glycol), 

30 hybridization reaction temperature and time parameters, as well as, varying wash 
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conditions. The selection of a particular set of hybridization conditions is selected 
following standard methods in the art (see, for example, Sambrook, et al., Molecular 
Cloning: A Laboratory Manual Second Edition, (1989) Cold Spring Harbor, N.Y.). 
A "gene" as used in the context of the present invention is a sequence of 
5 nucleotides in a genetic nucleic acid (chromosome, plasmid, etc.) with which a genetic 
function is associated. A gene is a hereditary imit, for example of an organism, 
comprising a polynucleotide sequence that occupies a specific physical location (a "gene 
locus" or "genetic locus") within the genome of an organism. A gene can encode an 
expressed product, such as a polypeptide or a polynucleotide (e.g., tRNA). Alternatively, 
10 a gene may define a genomic location for a particular event/function, such as the binding 

of proteins and/or nucleic acids, wherein the gene does not encode an expressed product. 
Typically, a gene includes coding sequences, such as, polypeptide encoding sequences, 
and non-coding sequences, such as, promoter sequences, polyadenlyation sequences, 

•y transcriptional regulatory sequences (e.g., enhancer sequences). Many eucaryotic genes 

in 

■LrJ 15 have "exons" (coding sequences) interrupted by "introns" (non-coding sequences). In 

certain cases, a gene may share sequences with another gene(s) (e.g., overlapping genes). 
A "coding sequence" or a sequence which "encodes" a selected polypeptide, is a 
jr{ nucleic acid molecule which is transcribed (in the case of DNA) and translated (in the 

=3 case of mRNA) into a polypeptide, for example, in vivo when placed under the control of 

\^ 20 appropriate regulatory sequences (or "control elements"). The boundaries of the coding 

sequence are typically determined by a start codon at the 5' (amino) terminus and a 
translation stop codon at the 3' (carboxy) terminus. A coding sequence can include, but is 
not limited to, cDNA from viral, procaryotic or eucaryotic mRNA, genomic DNA 
sequences from viral or procaryotic DNA, and even synthetic DNA sequences. A 
25 transcription termination sequence may be located 3' to the coding sequence. Other 

"control elements" may also be associated with a coding sequence. A DNA sequence 
encoding a polypeptide can be optimized for expression in a selected cell by using the 
codons preferred by the selected cell to represent the DNA copy of the desired 
polypeptide coding iicquence. "Encoded by" refers to a nucleic acid sequence which 
30 codes for a polypeptide sequence, wherein the polypeptide sequence or a portion thereof 
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contains an amino acid sequence of at least 3 to 5 amino acids, more preferably at least 8 
to 10 amino acids, and even more preferably at least 15 to 20 amino acids from a 
polypeptide encoded by the nucleic acid sequence. Also encompassed are polypeptide 
sequences which are immunologically identifiable with a polypeptide encoded by the 
5 sequence. 

Typical "control elements", include, but are not limited to, transcription 
promoters, transcription enhancer elements, transcription termination signals, 
polyadenylation sequences (located 3' to the translation stop codon), sequences for 
optimization of initiation of translation (located 5' to the coding sequence), translation 

10 enhancing sequences, and translation termination sequences. Transcription promoters 

can include inducible promoters (where expression of a polynucleotide sequence operably 
linked to the promoter is induced by an analyte, cofactor, regulatory protein, etc.), 
repressible promoters (where expression of a polynucleotide sequence operably linked to 
the promoter is induced by an analyte, cofactor, regulatory protein, etc.), and constitutive 

15 promoters. For purposes of the present invention, control elements for the dwp gene are 

found in the 5* and 3' UTRs shown in Figures 8A-8B, particularly at positions 43-142 and 
1710-1890, respectively, of the figure. Control elements for HDF7 are found within the 
5' and 3* UTRs shown in Figures lOA-lOF, particularly within the region between 
positions 1-1505 and 2721-2925, respectively. 

20 A control element, such as a promoter, "directs the transcription" of a coding 

sequence in a cell when RNA polymerase will bind the promoter and transcribe the 
coding sequence into mRNA, which is then translated into the polypeptide encoded by 
the coding sequence. 

"Expression enhancing sequences" typically refer to control elements that 

25 improve transcription or translation of a polynucleotide relative to the expression level in 

the absence of such control elements (for example, promoters, promoter enhancers, 
enhancer elements, and translational enhancers (e.g.. Shine and Delagamo sequences). 

"Operably linked" refers to a juxtaposition wherein the components so described 
are in a relationsiiip penniuing them to function in their intended manner. A control 

30 sequence "operably linked" to a coding sequence is ligated in such a way that expression 
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of the coding sequence is achieved under conditions compatible with the control 
sequences. The control elements need not be contiguous with the coding sequence, so 
long as they function to direct the expression thereof. Thus, for example, intervening 
untranslated yet transcribed sequences can be present between a promoter and the coding 
5 sequence and the promoter can still be considered "operably linked" to the coding 

sequence. 

A "heterologous sequence" as used herein typically refers to a nucleic acid 
sequence that is not normally found in the cell or organism of interest. For example, a 
DNA sequence encoding a polypeptide can be obtained from a plant cell and introduced 
10 into a bacterial cell. In this case the plant DNA sequence is "heterologous" to the native 

DNA of the bacterial cell. 

The "native sequence" or "wild-type sequence" of a gene is the polynucleotide 
sequence that comprises the genetic locus corresponding to the gene, e.g., all regulatory 
and open-reading frame coding sequences required for expression of a completely 
15 fiinctional gene product as they are present in the wild-type genome of an organism. The 

native sequence of a gene can include, for example, transcriptional promoter sequences, 
translation enhancing sequences, introns, exons, and poly-A processing signal sites. It is 
noted that in the general population, wild-type genes may include multiple prevalent 
versions that contain alterations in sequence relative to each other and yet do not cause a 
20 discemible pathological effect. These variations are designated "polymorphisms" or 

"allelic variations." 

"Recombinant" as used herein to describe a nucleic acid molecule means a 
polynucleotide of genomic, cDNA, semisynthetic, or synthetic origin which, by virtue of 
its origin or manipulation: (1) is not associated with all or a portion of the polynucleotide 
25 with which it is associated in nature; and/or (2) is linked to a polynucleotide other than 

that to which it is linked in nature. The term "recombinant" as used with respect to a 
protein or polypeptide means a polypeptide produced by expression of a recombinant 
polynucleotide. 

By "vector" is mcarxt any genetic elf^^-ment. such as a plasmid, phage, transposon, 
30 cosmid, chromosome, virus etc., which is capable of transferring gene sequences to target 
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cells. Generally, a vector is capable of replication when associated with the proper 
control elements. Thus, the term includes cloning and expression vehicles, as well as 
viral vectors and integrating vectors. 

As used herein, the term "expression cassette" refers to a molecule comprising at 
5 least one coding sequence operably linked to a control sequence which includes all 

nucleotide sequences required for the transcription of cloned copies of the coding 
sequence and the translation of the mRNAs in an appropriate host cell. Such expression 
cassettes can be used to express eukaryotic genes in a variety of hosts such as bacteria, 
blue-green algae, plant cells, yeast cells, insect cells and animal cells. Under the 

10 invention, expression cassettes can include, but are not limited to, cloning vectors, 

specifically designed plasmids, viruses or virus particles. The cassettes may further 
include an origin of replication for autonomous replication in host cells, selectable 
markers, various restriction sites, a potential for high copy number and strong promoters. 
A cell has been "transformed" by an exogenous polynucleotide when the 

1 5 polynucleotide has been introduced inside the cell. The exogenous polynucleotide may 
or may not be integrated (covalently linked) into chromosomal DNA making up the 
genome of the cell. In procaryotes and yeasts, for example, the exogenous DNA may be 
maintained on an episomal element, such as a plasmid. With respect to eucaryotic cells, a 
stably transformed cell is one in which the exogenous DNA has become integrated into 

20 the chromosome so that it is inherited by daughter cells through chromosome replication. 

This stability is demonstrated by the ability of the eucaryotic cell to establish cell lines or 
clones comprised of a population of daughter cells containing the exogenous DNA. 

"Recombinant host cells," "host cells," "cells," "cell lines," "cell cultures," and 
other such terms denoting procaryotic microorganisms or eucaryotic cell lines cultured as 

25 unicellular entities, are used interchangeably, and refer to cells which can be, or have 

been, used as recipients for recombinant vectors or other transfer DNA, and include the 
progeny of the original cell which has been transfected. It is understood that the progeny 
of a single parental cell may not necessarily be completely identical in morphology or in 
genomic or total DNA complerr>P!rit to the original parent, due to accidental or deliberate 

30 mutation. Progeny of the parental cell which are sufficiently similar to the parent to be 
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characterized by the relevant property, such as the presence of a nucleotide sequence 
encoding a desired peptide, are included in the progeny intended by this definition, and 
are covered by the above terms. 

The term polynucleotide" refers to a polynucleotide derived from, or 
5 homologous to, the dwp gene. The gene encodes the protein variously referred to herein 

as DWF7, STEl and DWF7/STEL DWF7 is a A^sterol C-5 desaturase that functions in 
the brassinolide (BL) biosynthetic pathway from squalene to BL (see, Figure 1). The 
dwp polynucleotide sequence and corresponding amino acid sequence are known and 
have been described in, e.g., Gachotte et al (1996) Plant J. 9:391-398 and GenBank 

10 accession No. AFl 05034. See, also. Figures 8A-8D depicting the dwp gene sequence 

and the corresponding DWF7 amino acid sequence. As shown in Figures 8A-8D, the 
dwp gene spans the region from nucleotide positions 1-1889; the upstream 5' UTR, 
including the promoter region, spans nucleotide positions 1-142; the downstream 3' UTR 
is present from nucleotide position 1710-1889. The term as used herein encompasses a 

1 5 polynucleotide including a native sequence depicted in Figures 8A-8D, as well as 

modifications and fragments thereof 

altemtion results in a plant displaying one or more dwp phenotypic traits (described 
below) whemthe polynucleotide is expressed in a plant. Such modifications typically 

20 include deletions, additions and substitutions, to the native dwp sequence, so long as the 

mutation results\n a plant displaying a dwp phenotype as defined below. These 
modifications ma\be deliberate, as through site-directed mutagenesis, or may be 
accidental, such as wough mutations of plants which express the dwp polynucleotide or 
errors due to PGR ammification. The term encompasses expressed allelic variants of the 

25 wild-type dwp sequenc^which may occur by normal genetic variation or are produced 

by genetic engineering m^hods and which result in a detectable change in the wild-type 
dwp phenotype. Two partmilar dwp allelic variants described herein are dwp -I and 
dwp -2, Polypeptides corresm^ to these variants include about amino acids 1-60 
and 1-230, respectively, of Figw.re Q. However, the boundaries of these polypeptides may 

30 vary by 1 to 10 or more amino a^ds, or any integer therebetween. Thus, dwf7-l and 
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d^n-2 polypeptides may include, for example, amino acids 1-59 and 1-229, respectively, 
or 3-Sfi and 3-232, respectively, and so on. Also described herein is a dwf7 
polynuc^tide termed The term ''dwf7 polynucleotide" as used herein, is 

intended to^compass the HDF7 polynucleotide. This polynucleotide is shown in 
5 Figures lOA-lCff' herein. The polypeptide encoded by HDF7 is depicted at about 

positions 1-279 o!£ Figure 1 1. As with the dwf7-l and dwf7-2 polypeptides, the 
boundaries of the HDF7 polypeptide may also vary by 1 to 10 or more amino acids, or 
any integer therebetv^n. These molecules are discussed in detail below. 

The term "rf>v/7 phenotype" as used herein refers to any microscopic or 

10 macroscopic change in structure or morphology of a plant, such as a transgenic plant, as 

well as biochemical differences, which are characteristic of a dwp plant, compared to a 
progenitor, wild-type plant cultivated under the same conditions. Generally, 
morphological differences include short robust stems, reduced fertility, prolonged life 
cycle, dark-green, round, and curled leaves when grown in the light. In the dark, these 

15 plants exhibit short hypocotyls and expanded cotyledons, as compared to the wild-type 

plant. The height of such plants will typically be 75% or less of the wild-type plant, more 
typically 50% or less of the wild-type plant, and even more typically 25% or less of the 
wild-type plant, or any integer in between. Additional phenotypic morphological 
attributes of the dwp mutant are summarized in Table 1 of the examples. Biochemically, 

20 dwp hypocotyls are converted to wild-type length with the application of BL. 

A "polypeptide" is used in it broadest sense to refer to a compound of two or more 
subunit amino acids, amino acid analogs, or other peptidomimetics. The subunits may be 
linked by peptide bonds or by other bonds, for example ester, ether, etc. As used herein, 
the term "amino acid" refers to either natural and/or unnatural or synthetic amino acids, 

25 including glycine and both the D or L optical isomers, and amino acid analogs and 

peptidomimetics. A peptide of three or more amino acids is commonly called an 
oHgopeptide if the peptide chain is short. If the peptide chain is long, the peptide is 
typically called a polypeptide or a protein. Full-length proteins, analogs, and fragments 
uicieof arc cnccmpacsed by the defirition. The terms also include postexpression 

30 modifications of the polypeptide, for example, glycosylation, acetylation. 
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phosphorylation and the like. Furthermore, as ionizable amino and carboxyl groups are 
present in the molecule, a particular polypeptide may be obtained as an acidic or basic 
salt, or in neutral form. A polypeptide may be obtained directly from the source 
organism, or may be recombinantly or synthetically produced (see further below). 
5 A "DWF7" polypeptide is a polypeptide as defined above, which is derived from 

a A^sterol C-5 desaturase that ftmctions in the brassinolide (BL) biosynthetic pathway 
from squalene to BL (see. Figure 1). The native sequence of ftill-length DWF7 is shown 
in Figure 9. However, the term encompasses analogs and fragments of the native 
sequence so long as the protein fiinctions for its intended purpose. Moreover, the term 
10 "DWF7 polypeptide" is intended to encompass the HDF7 polypeptide and analogs 

thereof 

The term "DWF7 analog" refers to derivatives of DWF7 and HDF7, or fragments 
of such derivatives, that retain desired fimction, e.g., as measured in assays as described 
fiirther below. In general, the term "analog" refers to compounds having a native 

15 polypeptide sequence and structure with one or more amino acid additions, substitutions 

(generally conservative in nature) and/or deletions, relative to the native molecule, so 
long as the modifications do not destroy desired activity. Preferably, the analog has at 
least the same activity as the native molecule. Methods for making polypeptide analogs 
are known in the art and are described ftirther below. 

20 Particularly preferred analogs include substitutions that are conservative in nature, 

i.e., those substitutions that take place within a family of amino acids that are related in 
their side chains. Specifically, amino acids are generally divided into four famiUes: (1) 
acidic - aspartate and glutamate; (2) basic - lysine, arginine, histidine; (3) non-polar — 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan; and 

25 (4) uncharged polar ~ glycine, asparagine, glutamine, cysteine, serine threonine, tyrosine. 

Phenylalanine, tryptophan, and tyrosine are sometimes classified as aromatic amino 
acids. For example, it is reasonably predictable that an isolated replacement of leucine 
with isoleucine or valine, an aspartate with a glutamate, a threonine with a serine, or a 
siiiiilai conscr/ative replacement an amino acid with a structurally related amino acid, 

30 will not have a major effect on the biological activity. It is to be understood that the 
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terms include the various sequence polymorphisms that exist, wherein amino acid 
substitutions in the protein sequence do not affect the essential functions of the protein. 

By "purified" and "isolated" is meant, when referring to a polypeptide or 
polynucleotide, that the molecule is separate and discrete fi-om the whole organism with 
5 which the molecule is found in nature; or devoid, in whole or part, of sequences normally 

associated with it in nature; or a sequence, as it exists in nature, but having heterologous 
sequences (as defined below) in association therewith. It is to be imderstood that the 
term "isolated" with reference to a polynucleotide intends that the polynucleotide is 
separate and discrete fi-om the chormosome from which the polynucleotide may derive. 

10 The term "purified" as used herein preferably means at least 75% by weight, more 

preferably at least 85% by weight, more preferably still at least 95% by weight, and most 
preferably at least 98% by weight, of biological macromolecules of the same type are 
present. An "isolated polynucleotide which encodes a particular polypeptide" refers to a 
nucleic acid molecule which is substantially free of other nucleic acid molecules that do 

15 not encode the subject polypeptide; however, the molecule may include some additional 

bases or moieties which do not deleteriously affect the basic characteristics of the 
composition. 

By "fragment" is intended a polypeptide or polynucleotide consisting of only a 
part of the intact sequence and structure of the reference polypeptide or polynucleotide, 

20 respectively. The fragment can include a 3' or C-terminal deletion or a 5' or N-terminal 
deletion, or even an internal deletion, of the native molecule. A polynucleotide fragment 
oizdwp sequence will generally include at least about 15 contiguous bases of the 
molecule in question, more preferably 18-25 contiguous bases, even more preferably 30- 
50 or more contiguous bases of the dwp molecule, or any integer between 15 bases and 

25 the full-length sequence of the molecule. Fragments which provide at least one dwfJ 

phenotype as defined above are useful in the production of transgenic plants. Fragments 
are also useful as oligonucleotide probes, to find additional dwp sequences. 

Similarly, a polypeptide fragment of a DWF7 molecule will generally include at 
least aboui 10 contiguous amino acid residii'^R of the full-length molecule, preferably at 

30 least about 15-25 contiguous amino acid residues of the full-length molecule, and most 
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preferably at least about 20-50 or more contiguous amino acid residues of the full-length 
DWF7 molecule, or any integer between 10 amino acids and the full-length sequence of 
the molecule. Such fragments are useful for the production of antibodies and the like. 
By "transgenic plant" is meant a plant into which one or more exogenous 
5 polynucleotides have been introduced. Examples of means by which this can be 

accomplished are described below, and include Agrobacterium-mediated transformation, 
biolistic methods, electroporation, and the like. In the context of the present invention, 
the transgenic plant contains a polynucleotide which is not normally present in the 
corresponding wild-type plant and which confers at least one dwp phenotypic trait to the 

10 plant. The transgenic plant therefore exhibits altered structure, morphology or 

biochemistry as compared with a progenitor plant which does not contain the transgene, 
when the transgenic plant and the progenitor plant are cultivated under similar or 
. equivalent growth conditions. Such a plant containing the exogenous polynucleotide is 
referred to here as an R, generation transgenic plant. Transgenic plants may also arise 

1 5 from sexual cross or by selfmg of transgenic plants into which exogenous 

polynucleotides have been introduced. Such a plant containing the exogenous nucleic 
acid is also referred to here as an R, generation transgenic plant. Transgenic plants which 
arise from a sexual cross with another parent line or by selfmg are "descendants or the 
progeny" of a R^ plant and are generally called F„ plants or S„ plants, respectively, n 

20 meaning the number of generations. 

11. Modes of Carrying Out the Invention 

Before describing the present invention in detail, it is to be understood that this 
invention is not limited to particular formulations or process parameters as such may, of 
25 course, vary. It is also to be understood that the terminology used herein is for the 

purpose of describing particular embodiments of the invention only, and is not intended 
to be limiting. 

Although a number of compositions and methods similar or equivalent to those 
described herein can be ii Sf^H in the practice of the present invention, the preferred 
30 materials and methods are described herein. 
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The present invention is based on the morphological, biochemical, and molecular 
analysis of Arabidopsis mutants. Morphologically, J>v/7 plants display a dramatic 
reduction in the length of many different organs examined, and this size reduction is 
attributable to a defect in cell elongation. Biochemically, dwp hypocotyls are converted 
to wild-type length with the application of BL, suggesting a deficiency in BRs. In 
agreement with this, BR intermediate feeding analysis, accompanied by analysis of 
endogenous levels of BRs and sterols by using GC-SIM, indicates that dwp is defective 
specifically in the sterol C-5 desaturase step of the sterol biosynthetic pathway. 
Sequencing of the sterol C-5 desaturase gene in two allelic variants, dwP-1 and 
dwp -2^ revealed premature stop codons, suggesting loss-of- function mutations. Thus, it 
appears that a shortage of sterols leads to a drastic reduction of BR levels in mutants 
and to the characteristic dwarf phenotype. 

The molecules of the present invention are therefore useful in the production of 
transgenic plants which display at least one dwp phenotype, so that the resulting plants 
have altered structure or morphology. The present invention particularly provides for 
altered structure or morphology such as reduced cell length, extended flowering periods, 
increased size of leaves or fruit, increased branching, increased seed production and 
altered sterol composition relative wild-type plants. The DWF7 polypeptides can be 
expressed to engineer a plant with desirable properties. The engineering is accomplished 
by transforming plants with nucleic acid constructs described herein which may also 
comprise promoters and secretion signal peptides. The transformed plants or their 
progenies are screened for plants that express the desired polypeptide. 

Engineered plants exhibiting the desired altered structure or morphology can be 
used in plant breeding or directly in agricultural production or industrial 
applications. Plants having the altered polypeptide can be crossed with other altered 
plants engineered with alterations in other growth modulation enzymes, proteins or 
polypeptides to produce lines with even further enhanced altered structural morphology 
characteristics compared to the parents or progenitor plants. 
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Isolation of Nucleic Acid Sequences from Plants 

The isolation oidwp sequences from the polynucleotides of the invention may be 
accomplished by a number of techniques. For instance, oligonucleotide probes based on 
the sequences disclosed here can be used to identify the desired gene in a cDNA or 
5 genomic DNA library from a desired plant species. To construct genomic libraries, large 

segments of genomic DNA are generated by random fragmentation, e.g. using restriction 
endonucleases, and are ligated with vector DNA to form concatemers that can be 
packaged into the appropriate vector. To prepare a library of tissue-specific cDNAs, 
mRNA is isolated from tissues and a cDNA library which contains the gene transcripts is 

1 0 prepared from the mRNA. 

The cDNA or genomic library can then be screened using a probe based upon the 
sequence of a cloned gene such as the polynucleotides disclosed here. Probes may 
be used to hybridize with genomic DNA or cDNA sequences to isolate homologous 
genes in the same or different plant species. Alternatively, the nucleic acids of interest 

15 can be amplified from nucleic acid samples using ampUfication techniques. For instance, 

polymerase chain reaction (PGR) technology to amplify the sequences of the genes 
directly from mRNA, from cDNA, from genomic libraries or cDNA libraries. PGR and 
other in vitro amplification methods may also be useful, for example, to clone nucleic 
acid sequences that code for proteins to be expressed, to make nucleic acids to use as 

20 probes for detecting the presence of the desired mRNA in samples, for nucleic acid 
sequencing, or for other purposes. 

Appropriate primei"s and probes for identifying t/iv^-specific genes from plant 
tissues are generated from comparisons of the sequences provided herein. For a general 
overview of PGR see Innis et al. eds, PCT Protocols: A Guide to Methods and 

25 Applications, Academic Press, San Diego (1990). Appropriate primers for this invention 
include, for instance, those primers described in the Examples and Sequence Listings, as 
well as other primers derived from the dwf sequences disclosed herein. Suitable 
amplifications conditions may be readily determined by one of skill in the art in view of 
uic leachings herein, for example, innhiding reaction components and amplification 

30 conditions as follows: 10 mM Tris-HGl, pH 8.3, 50 mM potassium chloride, 1 .5 mM 
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magnesium chloride, 0.001% gelatin, 200 dATP, 200 dCTP, 200 ^iM dGTP, 200 

dTTP, 0.4 [iM primers, and 100 units per mL Taq polymerase; 96°C for 3 min., 30 
cycles of 96°C for 45 seconds, 50°C for 60 seconds, 72°C for 60 seconds, followed by 
72Xfor5min. 

5 Polynucleotides may also be synthesized by well-known techniques as described 

in the technical literature. See, e.g., Carruthers, et al. (1982) Cold Spring Harbor Symp. 
Quant. BioL 47:411-418, and Adams, et al. (1983) 1 Am. Chem. Soc, 105:661. Double 
stranded DNA fragments may then be obtained either by synthesizing the complementary 
strand and annealing the strands together under appropriate conditions, or by adding the 

10 complementary strand using DNA polymerase with an appropriate primer sequence. 

The polynucleotides of the present invention may also be used to isolate or create 
other mutant cell gene alleles. Mutagenesis consists primarily of site-directed 
mutagenesis followed by phenotypic testing of the altered gene product. Some of the 
more commonly employed site-directed mutagenesis protocols take advantage of vectors 

15 that can provide single stranded as well as double stranded DNA, as needed. Generally, 

the mutagenesis protocol with such vectors is as follows. A mutagenic primer, i.e., a 
primer complementary to the sequence to be changed, but consisting of one or a small 
number of altered, added, or deleted bases, is synthesized. The primer is extended in 
vitro by a DNA polymerase and, after some additional manipulations, the now 

20 double-stranded DNA is transfected into bacterial cells. Next, by a variety of methods, 

the desired mutated DNA is identified, and the desired protein is purified from clones 
containing the mutated sequence. For longer sequences, additional cloning steps are often 
required because long inserts (longer than 2 kilobases) are unstable in those vectors. 
Protocols are known to one skilled in the art and kits for site-directed mutagenesis are 

25 widely available from biotechnology supply companies, for example from Amersham 

Life Science, Inc. (Arlington Heights, 111.) and Stratagene Cloning Systems (La JoUa, 
Cahf). 
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Control elements 

Regulatory regions can be isolated from the gene and used in recombinant 
constructs for modulating the expression of the dwp gene or a heterologous gene in vitro 
and/or in vivo. As shown in Figures 8A-8D, the coding region of the dwp gene 
5 (designated by the light grey bar) begins at nucleotide position 143. The region of the 

gene spanning nucleotide positions 1-142 of Figures 8A-8D includes the dwp promoter. 
This region may be used in its entirety or fragments of the region may be isolated which 
provide the abihty to direct expression of a coding sequence linked thereto. 

Thus, promoters can be identified by analyzing the 5' sequences of a genomic 

10 clone corresponding to the <ivv/7-specific genes described here. Sequences characteristic 

of promoter sequences can be used to identify the promoter. Sequences controlling 
eukaryotic gene expression have been extensively studied. For instance, promoter 
sequence elements include the TATA box consensus sequence (TATAAT), which is 
usually 20 to 30 base pairs upstream of the transcription start site. In most instances the 

15 TATA box is required for accurate transcription initiation. In plants, fiirther upstream 

from the TATA box, at positions -80 to -100, there is typically a promoter element with a 
series of adenines surrounding the trinucleotide G (or T) N G. (See, J. Messing et al., in 
Genetic Engineering in Plants, pp. 221-227 (Kosage, Meredith and HoUaender, eds. 
(1983)). Methods for identifying and characterizing promoter regions in plant genomic 

20 DNA are described, for example, in Jordano et al. (1989) Plant Cell 1 :855-866; Bustos et 

al. (1989) Plant Cell 1 :839-854; Green et al. (1988) EMBO J. 7:4035-4044; Meier et al. 
(1991) Plant Cell 3:309-316; and Zhang et al. (1996) Plant Physiology 110:1069-1079). 

Additionally, the promoter region may include nucleotide substitutions, insertions 
or deletions that do not substantially affect the binding of relevant DNA binding proteins 

25 and hence the promoter function. It may, at times, be desirable to decrease the binding of 

relevant DNA binding proteins to "silence" or "down-regulate" a promoter, or conversely 
to increase the binding of relevant DNA binding proteins to "enhance" or "up-regulate" a 
promoter. In such instances, the nucleotide sequence of the promoter region may be 
modified by, e.g., inserting aHHitional nucleotides, changing the identity of relevant 
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nucleotides, including use of chemically-modified bases, or by deleting one or more 
nucleotides. 

Promoter function can be assayed by methods known in the art, preferably by 

measuring activity of a reporter gene operatively linked to the sequence being tested for 
5 promoter function. Examples of reporter genes include those encoding luciferase, green 

fluorescent protein, GUS, neo, cat and bar. 

Polynucleotides comprising untranslated (UTR) sequences and intron/exon 

junctions are also within the scope of the invention. UTR sequences include introns and 

5' or 3' untranslated regions ( 5' UTRs or 3' UTRs). As shown in Figure 6, the dwp 
10 gene sequence includes three exons (thick boxes) and two introns (horizontal bars). See, 

also. Figures 8A-8D for the 5' and 3' UTRs. Similarly, the HDF7 gene includes three 

exons (at positions 1506-1734, 2024-2329 and IM^-lll^, denoted by the corresponding 

protein sequence indicated) and two introns (between these exons) and 5' and 3' UTRs. 

These portions of the dwp and HDF7 genes especially UTRs, can have regulatory 
15 functions related to, for example, translation rate and mRNA stability. Thus, these 

portions of the gene can be isolated for use as elements of gene constructs for expression 

of polynucleotides encoding desired polypeptides. 

Introns of genomic DNA segments may also have regulatory functions. 

Sometimes promoter elements, especially transcription enhancer or suppressor elements, 
20 are found within introns. Also, elements related to stability of heteronuclear RNA and 

efficiency of transport to the cytoplasm for translation can be found in intron elements. 

Thus, these segments can also find use as elements of expression vectors intended for use 

to transform plants. 

The introns, UTR sequences and intron/exon junctions can vary from the native 
25 sequence. Such changes fi-om those sequences preferably will not affect the regulatory 

activity of the UTRs or intron or intron/exon junction sequences on expression, 
transcription, or translation. However, in some instances, down-regulation of such 
activity may be desired to modulate traits or phenotypic or in vitro activity. 
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Use of Nucleic Acids of the Invention to Inhibit Gene Expression 

The isolated sequences prepared as described herein, can be used to prepare 
expression cassettes useful in a number of techniques. For example, expression cassettes 
of the invention can be used to suppress endogenous dwp gene expression. Inhibiting 
5 expression can be useful, for instance, in suppressing the phenotype {e.g., dwarf 

appearance, the sterol C-5 desaturase activity) exhibited by dwp plants. 

A number of methods can be used to inhibit gene expression in plants. For 
instance, antisense technology can be conveniently used. To accomplish this, a nucleic 
acid segment from the desired gene is cloned and operably linked to a promoter such that 

10 the antisense strand of RNA will be transcribed. The expression cassette is then 

transformed into plants and the antisense strand of RNA is produced. In plant cells, it has 
been suggested that antisense RNA inhibits gene expression by preventing the 
accumulation of mRNA which encodes the enzyme of interest, see, e.g., Sheehy et al. 
(1988) Proc, Nat Acad. ScL USA 85:8805-8809, and Hiatt et al., U.S. Patent Number 

15 4,801,340. 

The nucleic acid segment to be introduced generally will be substantially identical 
to at least a portion of the endogenous gene or genes to be repressed. The sequence, 
however, need not be perfectly identical to inhibit expression. The vectors of the present 
invention can be designed such that the inhibitory effect applies to other proteins within a 

20 family of genes exhibiting homology or substantial homology to the target gene. 

For antisense suppression, the introduced sequence also need not be full length 
relative to either the primary transcription product or fully processed mRNA. Generally, 
higher homology can be used to compensate for the use of a shorter sequence. 
Furthermore, the introduced sequence need not have the same intron or exon pattern, and 

25 homology of non-coding segments may be equally effective. Normally, a sequence of 
between about 30 or 40 nucleotides and about fiill length nucleotides should be used, 
though a sequence of at least about 100 nucleotides is preferred, a sequence of at least 
about 200 nucleotides is more preferred, and a sequence of at least about 500 nucleotides 
is espf^f^ially preferred. It is to be understood that any integer between the above-recited 

30 ranges is intended to be captured herein. 



28 



2225-0003 
PATENT 



Catalytic RNA molecules or ribozymes can also be used to inhibit expression of 
dwf? genes. It is possible to design ribozymes that specifically pair with virtually any 
target RNA and cleave the phosphodiester backbone at a specific location, thereby 
functionally inactivating the target RNA. In carrying out this cleavage, the ribozyme is 
5 not itself altered, and is thus capable of recycling and cleaving other molecules, making it 

a true enzyme. The inclusion of ribozyme sequences within antisense RNAs confers 
RNA-cleaving activity upon them, thereby increasing the activity of the constructs. 

A number of classes of ribozymes have been identified. One class of ribozymes is 
derived from a number of small circular RNAs which are capable of self-cleavage and 

10 replication in plants. The RNAs replicate either alone (viroid RNAs) or with a helper 

virus (satellite RNAs). Examples include RNAs from avocado sunblotch viroid and the 
satellite RNAs from tobacco ringspot virus, lucerne transient streak virus, velvet tobacco 
mottle virus, solanum nodiflorum mottle virus and subterranean clover mottle virus. The 
design and use of target RNA- specific ribozymes is described in Haseloff et al. (1988) 

15 Mz/wre 334:585-591. 

Another method of suppression is sense suppression. Introduction of expression 
cassettes in which a nucleic acid is configured in the sense orientation with respect to the 
promoter has been shown to be an effective means by which to block the transcription of 
target genes. For an example of the use of this method to modulate expression of 

20 endogenous genes see, NapoH et al. (1990) The Plant Cell 2:279-289 and U.S. Patent 

Numbers 5,034,323, 5,231,020, and 5,283,184. 

Generally, where inhibition of expression is desired, some transcription of the 
introduced sequence occurs. The effect may occur where the introduced sequence 
contains no coding sequence per se, but only intron or untranslated sequences 

25 homologous to sequences present in the primary transcript of the endogenous sequence. 

The introduced sequence generally will be substantially identical to the endogenous 
sequence intended to be repressed. This minimal identity will typically be greater than 
about 65%, but a higher identity might exert a more effective repression of expression of 
the endogenous sequences. Substantially greater identity of more than about 80% is 

30 preferred, though about 95% to absolute identity would be most preferred. It is to be 
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understood that any integer between the above-recited ranges is intended to be captured 
herein. As with antisense regulation, the effect should apply to any other proteins within 
a similar family of genes exhibiting homology or substantial homology. 

For sense suppression, the introduced sequence in the expression cassette, needing 
5 less than absolute identity, also need not be full length, relative to either the primary 

transcription product or fully processed mRNA. This may be preferred to avoid 
concurrent production of some plants which are overexpressers. A higher identity in a 
shorter than full length sequence compensates for a longer, less identical sequence. 
Furthermore, the introduced sequence need not have the same intron or exon pattern, and 
10 identity of non-coding segments will be equally effective. Normally, a sequence of the 

size ranges noted above for antisense regulation is used. 

Use of Nucleic Acids of the Invention to Enhance Gene Expression 

In addition to inhibiting certain features of a plant, the polynucleotides of the 

15 invention can be used to increase certain features such as extending flowering, producing 

larger leaves or firuit, producing increased branching and increasing seed production. 
This can be accomplished by the overexpression of dwf7 polynucleotides. 

The exogenous dwp polynucleotides do not have to code for exact copies of the 
endogenous DWF7 and HDF7 proteins. Modified DWF7 and HDF7 protein chains can 

20 also be readily designed utilizing various recombinant DNA techniques well known to 

those skilled in the art and described for instance, in Sambrook et al., supra, 
Hydroxylamine can also be used to introduce single base mutations into the coding region 
of the gene (Sikorski et al. (1991) Meth, Enzymol 194: 302-318). For example, the 
chains can vary from the naturally occurring sequence at the primary structure level by 

25 amino acid substitutions, additions, deletions, and the like. These modifications can be 

used in a number of combinations to produce the final modified protein chain. 
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Preparation of Recombinant Vectors 

To use isolated sequences in the above techniques, recombinant DNA vectors 
suitable for transformation of plant cells are prepared. Techniques for transforming a 
wide variety of higher plant species are well known and described further below as well 
5 as in the technical and scientific literature. See, for example, Weising et al. (1988) Ann. 

Rev. Genet. 22:421-477. A DNA sequence coding for the desired polypeptide, for 
example a cDNA sequence encoding the full length DWF7 protein, will preferably be 
combined with transcriptional and translational initiation regulatory sequences which will 
direct the transcription of the sequence fi'om the gene in the intended tissues of the 

10 transgenic plant. 

Such regulatory elements include but are not limited to the promoters derived 
from the genome of plant cells (e.g., heat shock promoters such as soybean hspl7.5-E or 
hspl7.3-B (Gurley et al. (1986) Mol. Cell. Biol. 6:559-565); the promoter for the small 
subunit of RUBISCO (Coruzzi et al. (1984) EMBOJ, 3:1671-1680; Broghe et al. (1984) 

15 Science 224:838-843); the promoter for the chlorophyll a/b binding protein) or from plant 

viruses viral promoters such as the 35S RNA and 19S RNA promoters of CaMV (Brisson 
et al. (1984) Nature 310:51 1-514), or the coat protein promoter of TMV (Takamatsu et al. 
(1987) EMBO J. 6:307-3 1 1), cytomegalovirus hCMV immediate early gene, the early or 
late promoters of SV40 adenovirus, the lac system, the trp system, the TAG system, the 

20 TRC system, the major operator and promoter regions of phage A, the control regions of 

fd coat protein, the promoter for 3-phosphoglycerate kinase, the promoters of acid 
phosphatase, heat shock promoters {e.g., as described above) and the promoters of the 
yeast alpha-mating factors. 

In construction of recombinant expression cassettes of the invention, a plant 

25 promoter fragment may be employed which will direct expression of the gene in all 

tissues of a regenerated plant. Such promoters are referred to herein as "constitutive" 
promoters and are active under most environmental conditions and states of development 
or cell differentiation. Examples of constitutive promoters include the cauliflower mosaic 
virus (CmvIV) 35S tranccription initiatiori region, the T-DNA mannopine synthetase 

30 promoter (e.g., the T- or 2'- promoter derived from T-DNA of Agrobacterium 
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tumafaciens), and other transcription initiation regions from various plant genes known to 
those of skill. 

Alternatively, the plant promoter may direct expression of the polynucleotide of 
the invention in a specific tissue (tissue-specific promoters) or may be otherwise under 
5 more precise environmental control (inducible promoters). Examples of tissue-specific 

promoters under developmental control include promoters that initiate transcription only 
in certain tissues, such as fiaiit, seeds, or flowers such as tissue- or developmental-specific 
promoter, such as, but not limited to the cell promoter, the CHS promoter, the PATATIN 
promoter, etc. The tissue specific E8 promoter from tomato is particularly useful for 

10 directing gene expression so that a desired gene product is located in fruits. 

Other suitable promoters include those from genes encoding embryonic storage 
proteins. Examples of environmental conditions that may affect transcription by 
inducible promoters include anaerobic conditions, elevated temperature, or the presence 
of light. If proper polypeptide expression is desired, a polyadenylation region at the 

1 5 3*-end of the coding region should be included. The polyadenylation region can be 

derived from the natural gene, from a variety of other plant genes, or from T-DNA. In 
addition, the promoter itself can be derived from the dwp or HDF7 genes, as described 
above. 

The vector comprising the sequences (e.g., promoters or coding regions) from 
20 genes of the invention will typically comprise a marker gene which confers a selectable 
phenotype on plant cells. For example, the marker may encode biocide resistance, 
particularly antibiotic resistance, such as resistance to kanamycin, G418, bleomycin, 
hygromycin, or herbicide resistance, such as resistance to chlorosluforon or Basta. 

25 Production of Transgenic Plants 

DNA constructs of the invention may be introduced into the genome of the 
desired plant host by a variety of conventional techniques. For reviews of such 
techniques see, for example, Weissbach & Weissbach Methods for Plant Molecular 
Biology (19SS, Academic Press, N V ) Section VIII, pp. 421-463; and Grierson & Corey, 

30 Plant Molecular Biology (1988, 2d Ed.), Blackie, London, Ch. 7-9. For example, the 



32 



2225-0003 
PATENT 



DNA construct may be introduced directly into the genomic DNA of the plant cell using 
techniques such as electroporation and microinjection of plant cell protoplasts, or the 
DNA constructs can be introduced directly to plant tissue using biolistic methods, such as 
DNA particle bombardment (see, e.g., Klein et al. (1987) Nature 
5 Alternatively, the DNA constructs may be combined with suitable T-DNA flanking 

regions and introduced into a conventional Agrobacterium tumefaciens host vector. 
Agrobacterium tumefaciens-mediated transformation techniques, including disarming 
and use of binary vectors, are well described in the scientific literature. See, for example 
Horsch et al. (1984) Science 233:496-498, and Fraley et al. (1983) Proc, Natl Acad. ScL 
10 USA 80:4803. The virulence functions of the Agrobacterium tumefaciens host will direct 

the insertion of the construct and adjacent marker into the plant cell DNA when the cell is 
infected by the bacteria using binary T DNA vector (Bevan (1984) Nuc, Acid Res. 

^0 12:871 1-8721) or the co-cultivation procedure (Horsch et al. (1985) Science 

-y 227: 1229-1231). Generally, the Agrobacteriiun transformation system is used to engineer 

15 dicotyledonous plants (Bevan et al. (1982) Ann, Rev, Genet 16:357-384; Rogers et al. 

"4 (1986) Methods Enzymol. 1 18:627-641). The Agrobacterium transformation system may 

also be used to transform, as well as transfer, DNA to monocotyledonous plants and plant 

jjj cells, (see Hemalsteen et al. (1984) ^A^O 73:3039-3041; Hooykass-Van Slogteren et al. 

p (1984) Nature 311:763-764; Grimsley et al. (1987) Nature 325:1677-179; Boulton et al. 

;| 20 (1989) Plant MoL Biol 12:31-40.; and Gould et al. (1991) Plant Physiol 95:426-434). 

^~ Alternative gene transfer and transformation methods include, but are not limited 

to, protoplast transformation through calcium-, polyethylene glycol (PEG)- or 
electroporation-mediated uptake of naked DNA (see Paszkowski et al (1984) EMBO J 
3:2717-2722, Potrykus et al. (1985) Molec. Gen. Genet, 199:169-177; Fromm et al. 
25 (1985) Proc. Nat. Acad. ScL USA 82:5824-5828; and Shimamoto (1989) Nature 

338:274-276) and electroporation of plant tissues (D*Halluin et al. (1992) Plant Cell 
4:1495-1505). Additional methods for plant cell transformation include microinjection, 
silicon carbide mediated DNA uptake (Kaeppler et al. (1990) Plant Cell Reporter 
9:-415-^ 18), and microprojectile bombardment (see Klein et al. (1988) Proc, Nat, Acad, 
30 Sci, USA 85:4305-4309; and Gordon-Kamm et al. (1990) Plant Cell 2:603-618). 
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Transformed plant cells which are produced by any of the above transformation 
techniques can be cultured to regenerate a whole plant which possesses the transformed 
genotype and thus the desired phenotype. Such regeneration techniques rely on 
manipulation of certain phytohormones in a tissue culture growth medium, typically 
5 relying on a biocide and/or herbicide marker which has been introduced together with the 
desired nucleotide sequences. Plant regeneration from cultured protoplasts is described in 
Evans, et aL, "Protoplasts Isolation and Culture" in Handbook of Plant Cell Culture, pp. 
124-176, MacmilUan Publishing Company, New York, 1983; and Binding, Regeneration 
of Plants, Plant Protoplasts^ pp. 21-73, CRC Press, Boca Raton, 1985. Regeneration can 

10 also be obtained from plant callus, explants, organs, pollens, embryos or parts thereof 

Such regeneration techniques are described generally in Klee et al. (1987) Ann. Rev, of 
Plant Phys.3S:467-4S6, 

The nucleic acids of the invention can be used to confer desired traits on 
essentially any plant. A wide variety of plants and plant cell systems may be engineered 

1 5 for the desired physiological and agronomic characteristics described herein using the 
nucleic acid constructs of the present invention and the various transformation methods 
mentioned above. In preferred embodiments, target plants and plant cells for engineering 
include, but are not limited to, those monocotyledonous and dicotyledonous plants, such 
as crops including grain crops (e.g., wheat, maize, rice, millet, barley), fruit crops (e.g., 

20 tomato, apple, pear, strawberry, orange), forage crops (e.g., alfalfa), root vegetable crops 
(e.g., carrot, potato, sugar beets, yam), leafy vegetable crops (e.g., lettuce, spinach); 
flowering plants (e.g., petunia, rose, chrysanthemum), conifers and pine trees (e.g., pine 
fir, spruce); plants used in phytoremediation (e.g., heavy metal accumulating plants); oil 
crops (e.g., sunflower, rape seed) and plants used for experimental purposes (e.g., 

25 Arabidopsis). Thus, the invention has use over a broad range of plants, including, but not 

limited to, species from the genera Asparagus, Avena, Brassica, Citrus, CitruUus, 
Capsicum, Cucurbita, Daucus, Glycine, Hordeum, Lactuca, Lycopersicon, Malus, 
Manihot, Nicotiana, Oryza, Persea, Pisum, Pyrus, Prunus, Raphanus, Secale, Solanum, 
Sorghum, Triticum, Vitis, Vigna, and Zea. 
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One of skill in the art will recognize that after the expression cassette is stably 
incorporated in transgenic plants and confirmed to be operable, it can be introduced into 
other plants by sexual crossing. Any of a number of standard breeding techniques can be 
used, depending upon the species to be crossed. 
5 A transformed plant cell, callus, tissue or plant may be identified and isolated by 

selecting or screening the engineered plant material for traits encoded by the marker 
genes present on the transforming DNA, For instance, selection may be performed by 
growing the engineered plant material on media containing an inhibitory amount of the 
antibiotic or herbicide to which the transforming gene construct confers resistance. 
10 Further, transformed plants and plant cells may also be identified by screening for the 

activities of any visible marker genes (e.g., the P-glucuronidase, luciferase, B or CI 
genes) that may be present on the recombinant nucleic acid constructs of the present 
invention. Such selection and screening methodologies are well known to those skilled in 
the art. 

1 5 Physical and biochemical methods also may be used to identify plant or plant cell 

transformants containing the gene constructs of the present invention. These methods 
include but are not limited to: 1) Southern analysis or PGR amplification for detecting 
and determining the structure of the recombinant DNA insert; 2) Northern blot, SI RNase 
protection, primer-extension or reverse transcriptase-PCR amplification for detecting and 

20 examining RNA transcripts of the gene constructs; 3) enzymatic assays for detecting 

enzyme or ribozyme activity, where such gene products are encoded by the gene 
construct; 4) protein gel electrophoresis, Western blot techniques, immunoprecipitation, 
or enzyme-linked immunoassays, where the gene construct products are proteins. 
Additional techniques, such as in situ hybridization, enzyme staining, and 

25 immunostaining, also may be used to detect the presence or expression of the 

recombinant construct in specific plant organs and tissues. The methods for doing all 
these assays are well known to those skilled in the art. 

Effects of gene manipulation using the methods of this invention can be observed 
by, for example^ northern blots of the RNA (e.g., mRNA) isolated fi'om the tissues of 

30 interest. Typically, if the amount of mRNA has increased, it can be assumed that the 
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endogenous dwp gene is being expressed at a greater rate than before. Other methods of 
measuring DWF7 activity can be used. For example, cell length can be measured at 
specific times. Because dwp affects the BR biosynthetic pathway, an assay that 
measures the amount of BL can also be used. Such assays are known in the art. Different 
types of enzymatic assays can be used, depending on the substrate used and the method of 
detecting the increase or decrease of a reaction product or by-product. In addition, the 
levels of DWF7 protein expressed can be measured immunochemically, i.e., ELISA, 
RIA, EIA and other antibody based assays well known to those of skill in the art, by 
electrophoretic detection assays (either with staining or westem blotting), and sterol (BL) 
detection assays. 

The transgene may be selectively expressed in some tissues of the plant or at some 
developmental stages, or the transgene may be expressed in substantially all plant tissues, 
substantially along its entire life cycle. However, any combinatorial expression mode is 
also applicable. 

The present invention also encompasses seeds of the transgenic plants described 
above wherein the seed has the transgene or gene construct. The present invention further 
encompasses the progeny, clones, cell lines or cells of the transgenic plants described 
above wherein said progeny, clone, cell line or cell has the transgene or gene construct. 

Polypeptides 

The present invention also includes DWF7 polypeptides, including such 
polypeptides as a fusion, or chimeric protein product (comprising the protein, fragment, 
analog, mutant or derivative joined via a peptide bond to a heterologous protein sequence 
(of a different protein)). Such a chimeric product can be made by ligating the appropriate 
nucleic acid sequences encoding the desired amino acid sequences to each other by 
methods known in the art, in the proper coding frame, and expressing the chimeric 
product by methods commonly known in the art. 

In addition, DWF7 polypeptides, derivatives (including fragments and chimeric 
piOicins), irxUtantG and analogues can he chemically synthesized. See, e.g., Clark-Lewis 
et al. (1991) Biochem. 30:3128-3135 and Merrifield (1963) J. Amer. Chem. Soc, 
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85:2149-2156. For example, DWF7, derivatives, mutants and analogs can be synthesized 
by solid phase techniques, cleaved from the resin, and purified by preparative high 
performance liquid chromatography (e.g., see Creighton, 1983, Proteins, Structures and 
Molecular Principles, W. H. Freeman and Co., N.Y., pp. 50-60). DWF7, derivatives and 
5 analog that are proteins can also be synthesized by use of a peptide synthesizer. The 

composition of the synthetic peptides may be confirmed by amino acid analysis or 
sequencing (e.g., the Edman degradation procedure; see Creighton, 1983, Proteins, 
Structures and Molecular Principles, W. H. Freeman and Co., N.Y., pp. 34-49). 

10 Applications 

The present invention finds use in various applications, for example, including but 
not limited to those listed above. 

The polynucleotide sequences may additionally be used to isolate mutant dwp 
gene alleles. Such mutant alleles may be isolated from plant species either known or 

15 proposed to have a genotype which contributes to altered plant morphology. 

Additionally, such plant dwp gene sequences can be used to detect plant dwp gene 
regulatory (e.g., promoter or promotor/enhancer) defects which can affect plant growth. 

The molecules of the present invention can be used to provide plants with 
increased seed and/fiiiit production, extended flowering periods and increased branching. 

20 The molecules described herein can be used to alter the sterol composition of a plants 
thereby increasing or reducing cholesterol content in the plant. A still fiarther utility of 
the molecules of the present invention is to provide a tool for studying the biosynthesis of 
brassinosteriods, both in vitro and in vivo. 

The dwp gene of the invention also has utility as a transgene encoding a the 

25 sterol C-5 desaturation protein that mediates one or more steps in brassinosteriod 

biosynthesis which results in a transgenic plant to alter plant structure or morphology. 
The dwJ7 gene also has utility for encoding the DWF7 protein in recombinant vectors 
which may be inserted into host cells to express the DWF7 protein. Further, the dwp 
pcl^rxUcleotides of the invention may be utilized (1) as nucleic acid probes to screen 

30 nucleic acid libraries to identify other enzymatic genes or mutants; (2) as nucleic acid 
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sequences to be mutated or modified to produce DWF7 protein variants or derivatives; 
(3) as nucleic acids encoding the sterol C-5 desaturases in molecular biology 
techniques or industrial applications commonly known to those skilled in the art. 

The dwp nucleic acid molecules may be used to design antisense molecules, 
5 useful, for example, in gene regulation or as antisense primers in amplification reactions 

of dwf7 gene nucleic acid sequences. With respect to dwf7 gene regulation, such 
techniques can be used to regulate, for example, plant growth, development or gene 
expression. Further, such sequences may be used as part of ribozyme and/or triple helix 
sequences, also useful for dwf7 gene regulation. 

10 The dwf7 control element (e.g., promoter) of the present invention may be utilized 

as a plant promoter to express any protein, polypeptide or peptide of interest in a 
transgenic plant. In particular, the dwf7 promoter may be used to express a protein 
involved in brassinosteriod biosynthesis. 

The Arabidopsis DWF7 protein of the invention can be used in any biochemical 

1 5 applications (experimental or industrial) where sterol C-5 desaturation activity is 

desired, for example, but not limited to, regulation of BL synthesis, regulation of other 
sterol synthesis, modification of elongating plant structures, and experimental or 
industrial biochemical applications known to those skilled in the art. 

20 III. Experimental 

Below are examples of specific embodiments for carrying out the present 
invention. The examples are offered for illustrative purposes only, and are not intended 
to limit the scope of the present invention in any way. 

Efforts have been made to ensure accuracy with respect to numbers used (e.g., 
25 amounts, temperatures, etc.), but some experimental error and deviation should, of 

course, be allowed for. 

Restriction and modifying enzymes, as well as PGR reagents were purchased fi'om 
commercial sources, and used according to the manufacturers* directions. In the cloning 
of DNA liagmcnts, except where noted, all DNA manipulations were done according to 
30 standard procedures. See, e.g., Sambrook et al., supra. Restriction enzymes, T^ DNA 

38 



22,25-0003 
PATENT 



ligase, E. coli, DNA polymerase I, Klenow fragment, and other biological reagents were 
purchased from commercial suppliers and used according to the manufacturers' 
directions. 

5 Materials and Methods 

A. Plant Growth 

For sterile growth of Arabidopsis thaliana plants, seeds of mutants and the wild 
type were sterilized (50% Clorox and 0.005% Triton X-100) for 8 min, washed three 
times with sterile distilled water, and dried with 95% ethanol. The seeds were sprinkled 

10 on 0.8% agar-solidified media or in liquid media containing 1 x Murashige and Skoog 

(Murashige and Skoog (1962) Physiol. Plant. 15:473-497) saUs and 0.5% sucrose (pH 5.8 
with KOH). For the plants grown in the dark, the seeds on the plates were illuminated for 
3 hr (240 jimol m"^ sec"*) before being wrapped with two or three layers of aluminum foil. 
For the mature plants used for morphometric analysis and gas chromatography-selective 

15 ion monitoring (GC-SIM) studies, seeds were planted on soil (Metromix 350; Grace 

Sierra Co., Milpitas, CA) presoaked with distilled water. The flats containing the pots 
were covered with plastic wrap and cold-treated at 4 °C for 2 days before transfer to a 
growth chamber (16 hr of Ught [240 jimol m'^ sec'*] and 8 hr of dark at 22 and 21 X, 
respectively, and 75 to 90% humidity). The plastic wrap was removed after 2 to 3 days. 

20 The pots were subirrigated in distilled water or Hoagland's nutrient solution as required. 

B. Morphometric and Physiological Analysis 

At 5 weeks of age, the various morphological traits listed in Table 1 (below) were 
measured. The number of seeds per silique was determined after the plants were 

25 completely dried. Unopened siliques from each plant were selected and crushed, and the 

number of seeds was counted under a dissecting microscope. To measure the fresh and 
dry weight, the aerial parts of the plants were cut and immediately weighed to obtain the 
fresh weight; the plants were then completely dried in a 60 °C oven for 5 days before 
measuring the dry weight. Flowers were harvested immediately after petal opening. 

30 Observations on the structure of flowers were made with flowers at stage 14 (Smyth et al. 
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(1990) Plant Cell 2:755-767), which are right beneath the cluster of developing flowers at 
the shoot apices. Individual organs of a flower were separated under the dissecting 
microscope. The length of the organs was measured to a tenth of a millimeter, and the 
four longest stamens for each flower were measured and the mean value calculated. 

The anatomical studies using a scanning electronic microscope and a light 
microscope were performed as described by Azpiroz et al. (1998) Plant Cell 10:219-230. 

C. Mapping and Sequencing of the DWARF7 Locus 

The mapping of dwp was performed using simple sequence length polymorphism 
(SSLP) markers (Bell and Ecker (1994) Genomics 19:137-144). Briefly, dwp-1 mutants 
(Wassilewskija-2 [Ws-2] background) were crossed to Columbia wild-type plants. 
Genomic DNA was isolated (Dellaporta et al. 1983) from individual F2 dwarf plants. To 
locate the mutation to one of the five chromosomes, 20 individual plants were tested with 
at least two SSLP markers per chromosome. The polymerase chain reaction (PCR) 
amplified products were analyzed on 4% agarose gels in 1 x TAE buffer (40 mM 
Tris-acetate and 10 mM EDTA). Once the dwp-1 mutation was shown to be linked to 
the ngal62 marker located on chromosome 3 (recombination ratio 1 1.9%), we tested 
marker ngal 72, which maps at 2.2 centimorgans. No recombination was detected 
between the dwf7-l mutation and ngal 72 when 86 chromosomes were tested, suggesting 
that dwp-1 is linked closely to the ngal72 marker. Linkage between the markers and the 
dwarf phenotype was determined according to Koomneef and Stam (1992) Genetic 
analysis. In Methods in Arabidopsis Research, C. Koncz, N.-H. Chua, and J. Schell, eds 
(Singapore: World Scientific Publishing Co.), pp. 83-99. 

PCR products amplified using primer sets derived from the cDNA sequencej^f^ 
STEROLl (STEl) were subjected to sequencing. To design sets ofprijnerg^^t do not 
fall in exon-intron junctions, we predicted possible^)liee"§i!esby using the RNASPL 
program available at the internet siteotBa5^ College of Medicine (Houston, TX; 
http://dot.imgen.bcm.tmc^5d«'^'^31/seq-search/gene-sea^^ Primers were designed 

uMiig uic Prim^istection software of DNAstar (DNASTAR Inc., Madison, WI). 
OHgonucle^tide sequences 5' to 3* are CAGTGTGAGTAAT T TAGCAT TACTA 
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(S5D_FF), GGAAAGATCATC-AAACAT T TACATGT (S5D_LR), GCGCAATCT^ 
TCT T TCGT T T (S5D_1F), TGGACAACAACAACACAAGA (S5D_1RV--^ 
GATGCACAGAGAGCT- TCATGAC (S5D_2F), CCGGCAAATpG^AGAGTGTAT 
(S5D_2R), CACCCATCATATCTACAACAA (S5D_3F},.arI^CATCT T T 
5 TGCCG-GCGAATCTAT (S5D_4F) (underlines>^€feadded to distinguish forward or 

reverse primers from the gene acronyni^50^ Primers were purchased from Genosys 
Biotechnologies, Inc. (The Wo9dl'^s, TX). For template DNA, genomic DNA was 
isolated from two or tb:©c^aves of dwf7-l and wild-type plants according to the method 
described by Kpyganet al. (1996) Proc. Natl. Acad. Sci. USA 93:8145-8150. 

10 Amplifipafion of the DNA fragment spanning the whole coding region was performed 

^j/im the S5D_4F and S5D_1R primer set with Taq polymerase (Boehringer Mannheim). 

Standard PGR reaction mixtures, 1 x PGR buffer (10 mM Tris-HCl, 1.5 mM 
MgCl2, and 50 mM KCl, pH 8.3), 0.2 )iM each of forward and reverse primer, 0.2 mM 
each deoxynucleotide triphosphates, 1 ng of genomic DNA, and 2 units of Taq 

15 polymerase were subjected to a PGR program consisting of an initial denaturation at 

95 °C for 2 min and then for 35 cycles (95 °G for 30 sec, 56 °C for 30 sec, and 72 °G for 
2.5 min ), with a final elongation step of 7 min at 72 °C. PGR-amplified DNA was 
size-separated on 0.8% agarose gels in 1 x TAB, and the resulting DNA bands were 
gel-purified using a DNA purification kit (Bio-Rad). The concentration of the extracted 

20 DNA was measured by comparing the band intensity with a DNA mass standard 

(Bethesda Research Laboratories). Sequencing of the DNA was performed at the Arizona 
Research Laboratory (University of Arizona, Tucson). DNA sequence analysis was 
conducted using software packages, including one fi'om Genetics Computer Group 
(Madison, WI) and other database search tools available on the Intemet. 

25 The base change in dwf7-l eliminated the recognition site for a restriction enzyme 

Haelll by converting the sequence from GGCC to AGCC. Thus, we utilized this 
polymorphism to test the cosegregation of the dwarf phenotype with the mutation. The 
0.8 kb of DNA spanning the mutation was amphfied using S5D_3F and S5D_1R primers 
from 17 different dwarf plants fi"om the mapping lines. Two microliters from each 20 |liL 

30 of PCR-amplified DNA was digested with the restriction enzyme Haelll (Boehringer 
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Mannheim). After complete digestion, the samples were resolved on a 2% agarose gel in 
IxTAE buffer. 

^ \ Genomic DNA sequence flanking the cDNA was identified by sequencing 
products obtained from thermal asymmetric interlaced PGR (TAIL PGR) (Lm-^fal. 
(1995) Plant J. 8:457-463). Two sets of primers were used to amplify t);*^ and 3' 
flanking DNA. Oligonucleotide sequences 5* to 3' are 

GTAGAAGGAGGAGAGGAAAGGGGAGATGAAGT (m^; melting temperature of 
69 X), AAGTATAGTAGGGT TGGGGGGAGG-TA07-5-2; melting temperature of 
64 X), ATAGAT TGGGGG-GCAAAAGATGApfC (D7-5-3; melting temperature of 
63 °G), TGG-AGGATAGGATAGGATAGAG^AGAGGACAT (D7-3-1; melting 
temperature of 68°G), GATAGGATAp^(CGAGAGGAGATAGAAGGAT-AAGTA 
(D7-3-2; melting temperature of 67^), and ATATGGATG-GAT TGGATGT T 
TGGGTGTG (D7-3-3; meltip^emperature of 63°G). The melting temperature of each 
primer was calculated w«i the formula 69.3 + 0.41 (%GG) - 650/L (Mazars et al. (1991) 
Nucleic Acids Res. 1^4783), where L is length of primer. Arbitrary degenerate primers 
ADl, AD2, mdAD3 were synthesized according to the sequence described by Liu et al. 
(1995) PlapTX 8:457-463. TAIL PGR was performed according to the program 
origiimHy described by Liu et al. 1995. TAIL PGR-amphfied DNA was separated on 1% 
agarose gels and gel extracted for sequencing. 

D. Feeding Experiments 

Biochemical complementation of dwp-1 plants with different concentrations of 
brassinolide (BL) was performed in Hquid media. BL-supplemented (control, 10'^, 10'^, 
and 10"^ M) sterile hquid media (1.5 mL) was dispensed into wells of a 24-well plate 
(Goming Co., Coming, NY). Three seedlings, germinated on agar-sohdified media, were 
transferred into each well. After a week of growth with continuous shaking (230 rpm), 
the seedlings were lightly stained with toluidine blue, and hypocotyls and roots were 
measured to the nearest millimeter. 

Feeding expenments using biosynthetic intermediates were performed with 
3-week-old mutant plants. The intermediates tested were diluted to the desired 
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# 



concentration with water containing 0.01% Tween 20. Two microliters of each 
brassinosteroid (BR) solution was applied daily to the shoot tips of plants by using a 
micro pipettman. After 1 week of treatment, total growth of inflorescence and pedicels 
was measured to the nearest millimeter (n = 15). 

5 

E. Analysis of Endogenous BRs 

Plants were grown for 5 weeks on soil. Two hundred grams of the aerial parts of 
plants, including stems, flowers, leaves, and siliques, was harvested and subjected to BR 
extraction. The procedure for extraction and analysis of BR intermediates by using 
10 GC-SIM has been described (Fujioka et al. (1997) Plant Cell 9:1951-1962). 

F. ^^C-Labeled Mevalonic Acid Feeding Experiments 

Before feeding experiments, seedlings were germinated and grown on 0.5 x 
Murashige and Skoog (Murashige and Skoog (1962) Physiol. Plant. 15:473-497) agar 

15 medium in the light at 22° C (25 mL per dish). Eight days after sowing, the seedlings 

were transferred to a 200-mL flask containing 30 mL of Murashige and Skoog 
(Murashige and Skoog (1962) Physiol. Plant. 15:473-497) media supplemented with 3% 
sucrose (Ws-2, five seedlings; dwf7-l, 40 seedHngs). 

Compactin (mevastatin; Sigma) was converted to its sodium salt as described 

20 previously (Kita et al. (1980) J. Clin. Invest. 66:1094-1 100). DL-Mevalonolactone-2-*^C 
(^^C-MVA; Isotec, Miamisburg, OH) was dissolved in methanol. Solutions of compactin 
and ^^C-MVA were added aseptically to each 200-mL flask (final concentration, 10 |xM 
compactin and 4.5 mM ^^C-MVA) just after the seedlings were transferred, and seedlings 
were allowed to grow for 1 1 days at 22°C in the light on a shaker (110 rpm). After 

25 incubation, the seedlings (~5 g fi"esh weight of both Ws-2 and dwf7-] plant materials) 

were extracted with methanol (250 mL), and the extract was partitioned between CHCI3 
and H2O. The CHClj-soluble firaction was purified with a silica cartridge column 
(Sep-Pak Vac 12 cc; Waters, Milford, MA), which was eluted with 20 mL of CHCI3. The 
eluate w?.s piirified with an octadecylsilane (ODS) cartridge column (Sep-Pak PLUS C18; 

30 Waters), which was eluted with 20 mL of methanol. The fi-action was subjected to HPLC 
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on an ODS column as follows: column, Senshu Pak ODS 4150-N (150 x 10 mm); 
solvent, methanol; flow rate, 2 mL /min; and detection, UV 205 nm. Fractions were 
collected every 0.5 min (between retention times of 10 to 20 min). Main fractions of each 
sterol were as follows: 5-dehydroepisterol (retention time of 1 1.5 to 12 min), episterol 
(retention time of 12.5 to 13 min), 24-methylenecholesterol (24-MC; retention time of 13 
to 13.5 min), 7-dehydrocampestanol (retention time of 14.5 to 15 min), and campesterol 
(CR; retention time of 15.5 to 16 min). 

Each fraction was converted to a trimethylsilyl derivative and analyzed by gas 
chromatography-mass spectrometry (GC-MS). GC-MS analyses were performed on a 
JEOL Automass JMS-AM 150 mass spectrometer (Tokyo, Japan) connected to a 
Hewlett-Packard 5890A-II gas chromatograph with a capillary column DB-5 (0.25 mm x 
15 m; 0.25-|am film thickness). The analytical conditions were the same as previously 
described (Fujioka et al. 1997). 

5-Dehydroepisterol, episterol, and 7-dehydrocampestanol were chemically 
synthesized. 

Example 1 
Isolation of dwp Mutants 
The dwP-1 mutant originally was identified in a screen of 14,000 
T-DNA-transformed lines of Arabidopsis. Genetic complementation tests with other dwf 
loci indicated that dwp belongs to a unique complementation group, dwp-l segregated 
as a monogenic recessive mutation; progeny fi-om a heterozygote segregated 325 (wild- 
type):98 {dwp-l). Although dwp-1 originated from a T-DNA mutant population, it 
failed to cosegregate with the kanamycin resistance marker in the T-DNA, suggesting 
that dwp-1 was an imtagged mutant. Furthermore, mapping the dwP-l mutation to the 
Arabidopsis genome by using simple sequence length polymorphisms (SSLPs; Bell and 
Ecker (1994) Genomics 19:137-144) confirmed that dwp maps to a location different 
from previously isolated dwarfs. The meiotic recombination ratio between dwp and the 
SSLP marker n^alll on chromosome 3 was scored as 0 / 86, indicating tight linkage of 
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dwp to ngal 72, According to a recent recombinant inbred map of Arabidopsis, ngal 72 
is located 2.2 centimorgans from the top of chromosome 3. 

A second allele of dwp was identified among 43 dwarf mutants isolated by 
screening >50,000 M2 seeds of an EMS mutant population. Similar to dwp-1, the new 
5 allele was biochemically complemented by early BR biosynthetic intermediates, 

including 22 a-hydroxycampesterol (22-OHCR) and cathasterone, and mapped near 
ngal 72, Sequencing revealed a premature stop codon in exon 1 (see below). 



Example 2 

10 Morphological Analysis of dwH-l 

dwp displays many of the characteristics of other BR dwarfs. The characteristic 
dwarf phenotype, such as short robust stems, reduced fertility, and dark-green, round, and 
curled leaves are found in the plants. Compared with 1 -month-old wild-type plants, 
dwP-1 plants grovra for 5 weeks in the light possess short robust inflorescences, 

15 dark-green, round leaves, reduced fertility, and short pedicels and siliques. The wild-type 

generally terminates flowering before 7 weeks of age; however, dwP-1 continues to 
produce flowers at this age. At 7 weeks of age, wild-type plants had ceased growing, 
whereas dwp-l plants continued to grow, indicating a prolonged life span. 

Additional morphological defects of 5-week-old light-grown plants are 

20 summarized in Table 1. Most noticeably, the height ofdwp-1 plants is strikingly 

reduced and is only 14% that of wild-type height. The leaf blade width oidwP-1 
mutants is similar to that of wild-type plants; however, the length is greatly reduced (1.8 
cm) as compared with that of the wild type (3 cm), resulting in the round shape of dwp-l 
leaves. The overall morphology oidwp-2 was similar to dwp-l except that it was 

25 slightly shorter and more sterile. 
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Table 1 . Moiphometric Analysis of Wild-Type and c/iv/7-iPlants at 5 Weeks of Age 


Measurement (« = 15) 


Wild Type 


dwp-1 


Inflorescence 
Height (cm) 

Number of inflorescences 


31.6 ±0.9 
3.9 ±0.6 


4.5 ±0.4 
4.3 ±0.5 


Reproductive organs 
Number of reproductive organs 
Length of siliques (mm) 
Number of seeds^ 


130.2 ± 12.9 
14.8 ± 1.2 
49.7 ±5.1 


89.3 ± 20.9 
3.9 ±0.8 
12A±2A 


Leaf 

Number of resette leaves 
Leaf blade width (cm)'' 
Leaf blade length (cm)'' 


9.1 ± 1.2 
1.4±0.1 
3.0 ±0.3 


10.3 ± 1.9 
1.4 ±0.3 
1.8 ±0.3 


Weight 

Fresh weight (g) 
Dry weight (mg) 
Fresh weight/dry weight 


1.50±0.19 
215 ±29 
7.0 ± 0.3 


0.51 ±0.10 
53± 11 
9.7 ±0.6 



20 

""The number of sees per silique was determined after plant senescence. 
^'The second pair of rosette leaves. 

25 Because null mutations in the BR pathway result in a dwarf phenotype, as well as 

defects in skotomorphogenesis, we compared the dwf7-l mutant with other BR dwarfs for 
growth in the dark. Hypocotyl lengths from the longest to the shortest were 18 ± 1 .6 
(wild-type; units in millimeters ± SE; n = 15), 6.3 ± 0.29 {dwp-l\ 4.1 ± 0.03 
{det2/dwf6), 1.26 ± 0.09 {dwf4\ \2A ± 0.08 {cpd/dwf3\ and 1.18 ± 0.08 {bnl/dwf2\ 

30 These data indicate that dwp-1 displays a less severe phenotype (35% that of wild-type 

hypocotyl length) than do other BR dwarfs (e.g., 7% of wild type in dwf4\ Choe et al. 
(1998) Plant Cell 10:231-243). Furthermore, dwf-l frequently displayed closed 
cotyledons and hooks similar to those of the wild type, whereas severe dwarfs, including 
kril/dy.p, cpd/dwf3. and dwf4^ showed expanded cotyledons and open hooks. 
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Unlike severe dwarfs, such as dwf4 and cpd, dwp-l mutants are not mechanically 
sterile. However, the average number of seeds in a silique is reduced in dwp-1 (n = 12) 
compared with that of the wild-type for reasons yet to be identified (n = 49) (Table 1). 
Scanning electron microscopy demonstrated a relationship between fertility and floral 
5 structure. In the wild type, the length of stamens was greater than or similar to that of the 

gynoecium (quantified in Figure 2), facihtating dehiscence of pollen on the stigmatic 
surface. The fertile dwp-l flower had a concomitant reduction in the size of the 
gynoecium and the stamen. Although dwp-1 flowers (Figure 2) possess stamens and 
gynoecia that are shorter than those in the wild type, the fertility of dwp-1 flowers is 

10 possible through the concomitant reduction in the length of both organs. In contrast, only 

stamen elongation was affected more severely in dwf4-3 flowers (Figwe 2). Because 
sterile dwf4-3 flowers have shorter filaments than the gynoecium, pollen dehiscence on 
the stigmatic surface is prevented. The short stamen length in dwf4 is likely to cause 
dehiscence of pollen on the ovary wall rather than on the stigmatic surface. In fact, when 

15 dwf4 pollen is transferred to either wild-type or dwp-1 stigmas, viable seeds are made. 

The common denominator for the various phenotypes found in dwp-1 mutants is 
a reduction in longitudinal growth, which could be due to either a reduced number of 
cells or a failure in cell elongation. Observations made with other BR dwarf mutants 
suggest that the number of cells is comparable in the wild type and mutants (Kauschmann 

20 et al. (1996) Plant J. 9:701-713; Nomura et al. (1997) Plant Physiol. 113:31-37; Azpiroz 

et al. (1998) Plant Cell 10:219-230). The length of cells in the epidermis, cortex, and 
xylem of dwP-1 was greatly reduced (<30% of wild type). This reduced cell size was 
converted to the length of the wild type in response to daily application of 10'^ M BL for 
1 week. Thus, the reduced organ length in dwp-l also is due to a failure of cell 

25 elongation. 

The organization of vascular bundles in wild-type and dwp-1 mutants was also 
examined. Wild-type inflorescences possessed eight vascular bundles. However, the 
number of vascular bundles was reduced to six in dwP-1. Furthermore, the spacing 
be^veen the vascular bundles in dwp-1 was irregular. In the wild type, interfascicular 

30 parenchyma cells alternated regularly with vascular bundles; however, cross-sections of 
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dwp'l showed that two vascular bundles were joined without being separated by 
parenchyma cells. Within a single vascular bundle, the size and number of xylem cells in 
dwp-1 plants generally were reduced, whereas the number of phloem cells was similar to 
or even greater than that in the wild-type. This characteristic abnormality of vascular 
bundle organization has been observed consistently in other BR dwarfs (Szekeres et al. 
(1996) Cell 85:171-182). 

Example 3 

Biochemical Complementation of dwf7-l with BL 
Figure 3 demonstrates that dwp-1 seedlings grown in BL-supplemented liquid 
media were remarkably sensitive to BL. Growth in 1 nM BL induced significant 
elongation of dwp-1 hypocotyls (160% increase), whereas the wild-type increase was , 
marginal (5%). Treatment with 10 and 100 nM BL completely rescued dwp-1 
hypocotyls to wild-type length. The strongest response of the wild type to BL was 
obtained at 100 nM (Figure 3). Higher concentrations of BL (1 nM) caused a stressed 
morphology, including inhibition of root growth and swollen, twisted, and fragile 
hypocotyls in both dwp-1 and wild-type plants. After BL treatment of dwp-1, cells in 
the treated region of the stem were similar in length to wild-type cells. 

The overall morphology of plants is dependent on three factors: cell size, shape, 
and number (Cosgrove (1997) Plant Cell 9:1031-1041). Various signals modulate these 
factors. Environmental signals, such as water, temperature, and light, are transduced to 
invoke internal hormone signals, including auxins, gibbereUins, and BRs. These signals 
then trigger the cell elongation process, including but not Umited to cell wall loosening 
by xyloglucan endotransglycosylases and expansins. Thus, a block in any of the signal 
transduction cascades from the environmental signals to the cell elongation process could 
result in dwarfism. Mutants resistant to or deficient in classic hormones, such as auxin 
(e.g., auxin resistant2 [axr2]; Timpte (1992) Genetics 138:1239-1249) and gibberellin 
{[gal to ga5 mdgai]; Koomneef and van der Veen (1980) Theor. AppL Genet, 
5S:25^-2(^?^: Koomneef et al. (1985) Physiol. Plant. 65:33-39), often result in dwarfism. 
Thus, we first tested whether dwp is either rescued by or resistant to exogenous 
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application of these hormones. Three-week-old dwp-l plants sprayed with 0.1 mM GA3 
responded, as did the wild-type (<10% increase of inflorescence height); however, GA3 
did not rescue the dwp-l phenotype. In addition, dwp-1 roots grown on indole acetic 
acid-supplemented agar media (0.1 |iM) displayed stunted morphology similar to that of 
the wild-type, suggesting that dwp-1 is not resistant to the exogenous application of 
auxin. The reduction of hypocotyl length in dwp-1 was rescued by the application of BL 
(Figure 3). Both wild-type and dwp-1 plants responded to BL, but dwp-l plants were 
hypersensitive. The length of dwp-1 hypocotyls was increased 160% in response to 1 
nM BL as compared with the untreated control, whereas the wild-type responded 
marginally (5%). In addition, application of BRs to 3-week-old dwp-l plants induced 
the growth of many different organs, including stems, leaves, siliques, petioles, and 
pedicels, suggesting that the major defect in dwp-1 is a deficiency of BL. 

Apart from a reduction in cell elongation, a deficiency of endogenous BRs 
resulted in altered organization of vascular tissue in the inflorescence. Szekeres et al, 
(1996) Cell 85:171-182 showed that the number of xylem cells in cpd was decreased as 
compared with the wild-type, whereas the number of phloem cells was increased. The 
authors reasoned that this could be due to unequal division of cambial cells. 
Furthermore, previous studies on the effects of BRs on vascular development indicated 
that BRs play a role in tracheary element formation (Clouse and Zurek (1991) Molecular 
analysis of brassinolide action in plant growth and development. In Brassinosteroids: 
Chemistry, Bioactivity and Applications, H.G. Cutler, T. Yokota, and G. Adam, eds 
(Washington DC: American Chemical Society), pp. 122-140; Iwasaki and Shibaoka 
(1991) Plant Cell. Physiol. 32:1007-101). Because BRs also have been found in the 
cambial region of pine, indicative of an important role in this tissue (Kim et al. (1990) 
Plant Physiol. 94:1709-1713), we hypothesize that the deficiency of BRs in dwarf 
mutants caused changes in cell fate in vascular cambial cells through yet unknown 
mechanisms. 

Auxins also are known to be a major factor affecting differentiation of the 
vascular svstem (Aloni (1987) Annu. Rev. Plant Physiol. 38:179-204). Lincoln et al. 
(1990) Plant Cell 2:1071-1080 showed that stem cross-sections of axrl displayed altered 
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development of the vascular system. The vascular bundles in axrl mutants are located 
peripherally and are not as regularly spaced as compared with those in wild-type plants 
(Lincoln et al. (1990) Plant Cell 2:1071-1080). Furthermore, as opposed to the reduced 
number of vascular bundles in (five to seven), axrl plants possess a greater 

5 number of bundles (eight to nine) as compared with the wild type (six to eight). Thus, it 

seems that auxins and BRs play opposing roles in determining the number of vascular 
bundles. Two other assays in which auxin and BR interactions have been demonstrated 
are the rice lamina bending assay and hypocotyl hook opening bioassay. Results from 
these assays include the fact that the degree of effect caused by the combined application 

10 of auxin and BR was greater than was the sum of the effect of each, indicative of a 

synergistic effect of the two hormones (Yopp et al. (1981) Physiol. Plant. 53:445-452; 
Takeno and Pharis (1982) Plant Cell Physiol. 23:1275-1281 reviewed in Mandava (1988) 
Annu. Rev. Plant Physiol. Plant Mol. Biol. 39:23-52). However, the details of the 
mechanisms for interactive and independent action remain to be elucidated. 

15 It needs to be pointed out that hypocotyl growth in darkness is accomplished 

through both GA- and BR-dependent cell elongation processes. One piece of evidence 
for dependence on both GA and BR is that dwf/-l hypocotyls elongated fivefold in 
response to darkness as compared with light-grown hypocotyls, although they are still 
shorter than those of the wild-type. Because BL levels are not detectable in dw^-l plants 

20 (Table 2), growth of dwP-1 in the dark could be accomplished mostly by GA-dependent 

cell elongation processes. Peng and Harberd (1997) Plant Physiol. 1 13:1051-1058 and 
Azpiroz et al. (1998) Plant Cell 10:219-230 found that both gai and dwf4, respectively, 
partially suppressed the stem elongation phenotype of a light receptor mutant, hy, 
suggesting that hypocotyl elongation in the absence of light inhibition requires 

25 independent growth contributed by both GA and BRs. 
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Example 4 

Identification of the BR Biosvnthetic Defect in dwfJ-1 
Biochemical complementation of dw^-l following application of BL suggested 
that dwP'l is likely to be defective in BR biosynthesis. To pinpoint the defective step in 
5 the BR biosynthetic pathway, dwP-1 mutants were treated with BR biosynthetic 

intermediates. Due to undetectable bioactivity of some early intermediates (CR to 
6-oxocampestanol) in bioassays (Fujioka et al. 1995; Choe et al. (1998) Plant Cell 
10:23 1-243), these were not used. Instead, three biologically active compounds were 
chosen, 22-OHCR, 6-deoxoCT, and BL, for these feeding tests (see Figure 1). Because 

10 the 22a-hydroxylation reaction is reported to be mediated by DWF4 (Choe et al. (1998) 

Plant Cell 10:231-243), biochemical complementation of c/m/ mutants other than dwf4 by 
22-OHCR places the defective step upstream of CR. 

Complementing compounds induced growth of intemodes and strongly increased 
pedicel length. The dwp-1 pedicels treated with 22-OHCR and BL showed growth 

1 5 greater than or equal to that of the wild-type. Measurements of pedicel length shovm in 

Figure 4 demonstrated that the three compounds tested, 22-OHCR, 6-deoxoCT, and BL, 
all increased dw^-l pedicel length >200% as compared with the control, suggesting that 
the defective step in BR biosynthesis is located at or before the CR biosynthetic step. 
Similarly, 3 -week-old inflorescences of dwp-2 were tested with 22-OHCR, 6-deoxoCT, 

20 teasterone, and BL. All four compounds induced significant elongation of pedicels and 

intemodes, indicating that dwP-1 and dwP-2 share the same biosynthetic defect. 

As shovra in Table 2, more definitive results indicating a specific defect in BR 
biosynthesis have been obtained from gas chromatography-selective ion monitoring 
(GC-SIM) analysis of endogenous BRs and sterols in dwp-l plants. The endogenous 

25 levels of sterols, such as 24-MC, CR, and campestanol (CN), in wild-type plants, were 

3800, 32,900, and 1 140 ng/g fresh weight, respectively. However, the levels of all three 
sterols in dwp-l mutants were extremely diminished at 3.1, 1.1, and 1.4% of the wild- 
type, respectively, suggesting that the biosynthetic block is located before 24-MC. These 
data are consistent with the results of intermediate feeding studies (Figure 4). 

30 
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Table 2. Quantification of Endogenous BRs from Wild Type and dwP-1 by 
Using GC-SM 


rJKS 


wiiQ lypc 


uwj /-I 




'x son 


lis 

i 1 0 






D 1)7 




1 1 AO 


1 ^ 
10 


o-ueoxoieaSierone 




XT A*' 


6-Deoxotyphasterol 


2.3 


NA 


6-Deoxocastasterone 


4.0 


ND' 


Typhasterol 


0.27 


ND 


CS 


0.28 


0.13 


BL 


0.2 


ND 



15 ^The unit of measurement is nanograms per gram fresh weight. 

^NA, not analyzed. 

^ND, not detected. The endogenous amount of the BR is less than the detection limit 
(-0.05 ng/g fresh weight). 

20 

Further biochemical feeding studies with *^C-labeled mevalonic acid (MVA) and 
compactin, a MVA biosynthetic inhibitor, were performed to identify the specific sterol 
biosynthetic step defective in dwP-1 plants. In a preliminary experiment, the effects of 
compactin and MVA on the growth of Arabidopsis seedlings in liquid media were 

25 investigated. The growth of wild-type Arabidopsis seedlings was almost completely 

inhibited in the presence of 10 )iM compactin. The inhibition, however, was restored to 
the level of controls by the simultaneous application of 4.5 mM of MVA. Therefore, 4.5 
mM *^C-MVA and 10 |xM compactin were added to Arabidopsis seedling cultures in the 
metabolic feeding studies. After 1 1 days in culture, sterols were extracted and purified by 

30 siUca and octadecylsilane (ODS) cartridge columns and ODS-HPLC. Purified samples 

were derivaiized and analyzed by gas chrorrxatcgraphy-mass spectrometry (GC-MS). As 
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shown in Figure 5, ^^C-MVA was converted to '^Cj-episterol and subsequent sterols, such 
as *^C5-24-MC and ^^Cj-CR in the wild-type. However, the ^^Cj-S-dehydroepisterol and 
downstream compounds were not detected in dwp-1 mutants, whereas the precursor 
^^C5-episterol accumulated fourfold as compared with the wild- type. In addition, an 
5 uncommon sterol, ^^Cj-T-dehydrocampestanol (24-epifungisterol), greatly accumulated 

(Figure 5). Two lines of evidence-a failure to convert episterol to subsequent sterols, 
such as 24-MC and CR, and accumulation of 7-dehydrocampestanol in Jvv/7-i -suggest 
that the defective step in dwP-1 is the C-5 desaturation stop. 

A defect either in a biosynthetic enzyme or a factor modulating an enzymatic 

10 activity could lead to deficiency of endogenous BRs. To place dwp at a specific step in 

the proposed BR biosynthetic pathway, we first chose to perform feeding studies with BR 
biosynthetic intermediates. Rescue of dwf7'l by exogenous application of 22-OHCR 
suggests that the biosynthetic defect likely resides before the production of CR. 
Consistent with the results fi*om feeding studies, the endogenous levels of 24-MC, CR, 

15 and CN were extremely reduced in dw/Z-I (Table 2). These data indicate that the 

biosynthetic defect is before 24-MC; dwf7-l contains only 3% of 24-MC as compared 
with the wild type. When the phenotypes of dwf7-l are compared with the downstream 
biosynthetic mutant dwf4 and the BR-insensitive bril {dwfl) mutant (Clouse et al. (1996) 
Plant Physiol. 1 1 1 :67 1-678), it is obvious that dwp-l displays a weaker phenotype 

20 despite being a presumptive null mutation. This suggests that there could be an 

altemative sterol and BR biosynthetic pathway or that there are duplicate genes at 
individual steps. Providing evidence for the duplicate gene hypothesis, we recently 
cloned a homolog of the DWF7/STE1 gene (named HOMOLOG 0FDWF7, HDF7), 
shown in Figures 10 and 1 1 (GenBank Accession No. AAF32466). HDF7 is 80% 

25 identical in amino acid sequence with STEl. Similarly, Fujioka et al. 1997 reported that 

the endogenous level of CN in det2, which is defective in a step between CR and CN, is 
--10% that of the wild-type amount. The authors hypothesized that the 10% leakage 
through the defective step in det2 mutants, even in a null allele, could be associated with 
a second copy of DET2 that lightly hybridizes in DNA gel blot analyses. 
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Placing dy\^p at a single sterol biosynthetic step was accomplished through 
feeding studies with *^C-MVA and compactin. A greater than fourfold accumulation of 
episterol accompanying the absence of downstream intermediates in dwp-1 indicates that 
the sterol C-5 desaturase step is blocked in dwp. In addition, the feeding studies 
5 identified an accumulation of 7-dehydrocampestanol, which is an uncommon sterol in 

plants (Figure 5). Accumulation of this compound only in dw^-l suggests that sterol 
biosynthesis in dwp-1 could proceed to a C-24 reduction step, skipping C-5 desaturation 
as well as the next immediate C-7 reduction. The C-24 reductase seems to convert 
episterol independently of the immediate upstream enzyme. The absence of a detectable 
10 amount of C-7-reduced compounds in dwp-1 suggests that the enzymatic step is highly 

dependent on the C-5 desaturation reaction. This confirms the sequence of reactions 
originally proposed by Taton and Rahier (1991) Biochem. Biophys. Res. Commun. 
181:465-473, Taton and Rahier (1996) Arch. Biochem. Biophys. 325:279-288. 

15 Example 5 

Molecular Characterization of dwp 
An EMS-induced mutant (stel-l) of STEl encoding a sterol C-5 desaturase did 
not possess a dwarf phenotype (Gachotte et al. (1995) Plant J. 8:407-416). However, 
because it is likely that stel-l is a leaky allele, it was hypothesized that dwp- 1 might be a 

20 strong or null allele. The genomic DNA of the STEl gene was sequenced and two introns 

and three exons identified by comparing them with the published STEl cDNA sequence. 
The organization of the STEl gene is represented schematically in Figure 6. Sequencing 
the STEl locus in the dwp alleles revealed mutations. The mutations found in dwp-1 and 
dwp-2 were located in the third and the first exons, respectively. Both of the dwp alleles 

25 contained a base change firom a guanine to an adenine, converting tryptophan (TGG) to a 
stop codon (TAG in dwp-1 and TGA in dwp-2\ 

In addition to creating a stop codon, the mutation in dwP-1 eliminated a Haelll 
restriction enzyme recognition site (GGCC to AGCC). Taking advantage of this 
restriction enzjmie site change, we tested the linkage of this mutation to the dwp-1 

30 phenotype. DNAs isolated from 17 different dwarf plants fi*om a segregating F2 
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population were subjected to polymerase chain reaction (PCR) analysis by using S5D_3F 
and S5D_1R primers (imderlines were used to distinguish forward or reverse primers 
from the gene acronym S5D), and the PCR products were digested with HaellL Agarose 
gel electrophoresis definitively showed that none of the PCR products from 17 mutant 
5 templates was restricted, whereas products from wild-type templates were all restricted at 
the Haelll site. These data suggest that the creation of the premature stop codon in exon 
3 is the cause of the -conferred phenotype. 

To better understand the importance of these nonsense mutations, we analyzed the 
sequence of STEl in relation to other C-5 desaturase proteins isolated from fiingi. The 

10 STEl protein is composed of 281 predicted amino acids with a theoretical pi of 6.39 and 

molecular mass of 33 kD. Whereas yeast ERG3 (38% identical; Arthington et al. (1991) 
Gene 107:173-174; GenBank accession number M62623) is predicted to contain four 
transmembrane domains, STEl possesses three putative transmembrane domains. The 
overall amino acid sequence identities of STEl with C-5 desaturases from fission yeast 

15 (GenBank accession number AB004539) and Candida glabrata (Geber et al. (1995) 

Antimicrob. Agents Chemother. 39:2708-2717; GenBank accession number L40390) 
were 37 and 33%, respectively (gap creation weight of 4; gap extension weight of 1). In 
addition, multiple sequence alignment of STEl with the three yeast sequences, shown in 
Figure 7, revealed that the transmembrane domains and histidine clusters, which were 

20 first reported by Gachotte et al. (1996) Plant J. 9:391-398, are well conserved between the 

proteins. The three characteristic histidine boxes flank the last transmembrane domain. 
The nonsense mutations are located in the first exon {dwf7-2) and the third exon, 
immediately before the third histidine box {dwp-l\ indicating that at least one histidine 
domain is deleted in each of the dwp mutants as a result of the premature stop codons. 

25 The sterol C-5 desaturase-mediated reaction is common to both photosynthetic 

and nonphotosynthetic organisms. Many genes encoding a C-5 desaturase have been 
cloned from fimgi. First, Arthington et al. (1991) Gene 107:173-174 cloned the ERGS 
gene from Saccharomyces cerevisiae. The authors found that viable erg3 mutants, which 
normally accumulate sterols, were restored to wild-type phenotype when transformed 

30 with a wild-type genomic clone of the A^ sterol C-5 desaturase gene. Taguchi et al. 
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(1994) Microbiology 140:353-359 showed that the yeast mutant syrJ displays dual 
phenotypes, resistance to the phytotoxin syringomycin and susceptibility to higher 
concentrations of Ca^"^, presumably due to altered membranes. Sequencing the ERGS 
locus in the syri mutant revealed that syrl is an allele of ERGS. Furthermore, Geber et 
5 al. (1995) Antimicrob. Agents Chemother. 39:2708-2717 cloned both ERGS and ERGJl 

(14a-sterol-demethylase) from C. glabrata. The authors found that lethal ergU 
mutations can be suppressed by an additional mutation in ergS. They reasoned that 
formation of toxic 3P,6a-diol sterols in ergll mutants is prevented due to the defect in 
C-5 desaturation in ergll ergS double mutants. 

10 In plants, Gachotte et al. (1995) Plant J. 8:407-416 found that the Arabidopsis 

stel-1 mutant, which is deficient in C-5 desaturated sterols, can be partially 
complemented by the yeast ERGS gene. Accordingly, the authors hypothesized that 
stel-1 possesses a mutation in the sterol C-5 desaturase gene. They isolated the 
Arabidopsis C-5 desaturase gene through heterologous complementation of a yeast ergS 

15 null mutant with an Arabidopsis cDNA library (Gachotte et al. (1996) Plant J. 

9:391-398). Finally, the partial human cDNA for the C-5 desaturase has been identified 
by Matsushima et al. (1996) Cell Genet. 74:252-254. Ahgnment of the sequences of 
these enzymes revealed that C-5 desaturases from different organisms are highly 
conserved in overall sequence as well as in specific domains. The overall amino acid 

20 sequence identity and similarity among STEl and ERG3 and the human ortholog is 38% 
(50%) and 35% (47%), respectively (similarity within parentheses). As indicated in 
Figure 6 and Figure 7, key domains including the transmembrane domains and the 
histidine clusters are well conserved between all the C-5 desaturases. 

Closely spaced histidine residues, HX3H in helices, serve as typical metal binding 

25 motifs in many proteins (Regan (1993) Annu. Rev. Biophys. Biomol. Struct. 

22:257-281). Shanklin et al. (1994) Biochemistry 33:12787-12794 showed that three 
membrane-associated bacterial enzymes, fatty acid desaturase, alkane hydroxylase, and 
xylene monooxygenase, possess eight histidine residues that are conserved in three 
regions dispersed in these enzymes, HX^3.4^H, HX(2.3)HH, and HX(2.3)HH (where X stands 

30 for any amino acid). DNA constructs containing site-directed mutations at any of these 
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eight histidine residues of the rat desaturase failed to complement the yeast mutant 
olel, which is defective in the same enzymatic step, suggesting that the individual 
histidine residues are essential for the function of the enzyme. On the basis of these 
observations, Shanklin et al. (1994) Biochemistry 33:12787-12794 hypothesized that the 
5 histidine clusters conserved in these enzymes constitute new structural domains of diiron 
binding centers (Shanklin et al. (1994) Biochemistry 33:12787-12794). Gachotte et al. 
(1996) Plant J. 9:391-398 first recognized the conserved histidine clusters in STEl and 
yeast proteins. We confirmed that the motifs are highly conserved in STEl and the yeast 
ERG3 enzymes with the same context of HX3H, HXjHH, and HXjHH (Figure 7), 

10 revealing the presence of a putative iron binding motif in sterol C-5 desaturases. 

More direct evidence of metal ion involvement in A^ sterol C-5 desaturase 
fimction was obtained by Taton and Rahier (1996) Arch. Biochem. Biophys. 
325:279-288. These authors discovered that the enzyme prepared fi^om maize 
microsomes is inhibited by cyanide, whereas it is insensitive to carbon monoxide, 

1 5 indicative of the involvement of a metal ion, presumably an iron, for the proper fimction 

of the enzyme. Furthermore, we noticed that the typical histidine moiety also was 
conserved in a different group of oxidases such as RANP-1 (Uwabe et al. (1997) 
Neuroscience 80:501-509), C-4 methyl sterol oxidase (Li and Kaplan (1996) J. Biol. 
Chem. 271:16927-16933), and aldehyde decarbonylase (Aarts et al. (1995) Plant Cell 

20 7:21 15-2127). Occurrence of these histidine boxes in a wide variety of oxidases indicates 

that this domain plays a common and essential role in the fimction of membrane oxidases. 
Therefore, it is likely that the mutations in dwf7-l and dwp-2 would be deleterious to 
protein fimction. The premature stop codon in dwp-2 would eliminate all important 
known domains, whereas the third histidine box and several amino acid residues that are 

25 100% conserved in the C terminus of the protein are eliminated in dwp-L Intriguingly, 

the location of the mutations in dwp-l and dwp-2 seems to be related to the phenotypic 
severity of the mutant alleles, dwp-2, which contains an earlier stop codon, was shorter 
in height and less fertile than dwP-1. A more precise comparison between the two alleles 
is not possible because the EMS allele, dwp-2, has not been outcrossed to remove any 

30 background mutations that might have increased the severity of the phenotype of dwp-2. 
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Despite the differences in severity, both dwf7 alleles are likely complete loss-of-function 
alleles. The resulting nonfunctional enzyme causes a block in sterol biosynthesis. This 
shortage of substrate sterols in dwf7-l and dwf7-2 leads to a deficiency of endogenous 
BRs and causes the characteristic dwarfism in dwf7 plants. 

Thus, novel dwf7 mutants, as well as methods of using the same, are disclosed. 
Although preferred embodiments of the subject invention have been described in some 
detail, it is understood that obvious variations can be made without departing from the 
spirit and the scope of the invention as defined by the appended claims. 
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